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Abstract 

in 

• This paper presents the Planck likelihood, a complete statistical description of the two-point correlation function of the CMB temperature fluctuations that accounts 
for all known relevant uncertainties, both instrumental and astrophysical in nature. We use this likelihood to derive our best estimate of the CMB angular power 
£~ ^ spectrum from Planck over three decades in multipole moment, t, covering 2 < I < 2500. The main source of error at I < 1500 is cosmic variance. Uncertainties in 
small-scale foreground modelling and instrumental noise dominate the error budget at higher is. For I < 50, our likelihood exploits all Planck frequency channels 
^— H from 30 to 353 GHz, separating the cosmological CMB signal from diffuse Galactic foregrounds through a physically motivated Bayesian component separation 
• • technique. At I > 50, we employ a correlated Gaussian likelihood approximation based on a fine-grained set of angular cross-spectra derived from multiple detector 
^ combinations between the 100, 143, and 217 GHz frequency channels, marginalizing over power spectrum foreground templates. We validate our likelihood through 
? an extensive suite of consistency tests, and assess the impact of residual foreground and instrumental uncertainties on the final cosmological parameters. We find 
good internal agreement among the high-^ cross-spectra with residuals below a few /uK 2 at I < 1000, in agreement with estimated calibration uncertainties. We 
t-H compare our results with foreground-cleaned CMB maps derived from all Planck frequencies, as well as with cross-spectra derived from the 70 GHz Planck map, 
and find broad agreement in terms of spectrum residuals and cosmological parameters. We further show that the best-fit ACDM cosmology is in excellent agreement 
with preliminary Planck EE and TE polarisation spectra. We find that the standard ACDM cosmology is well constrained by Planck from the measurements at 
I < 1500. One specific example is the spectral index of scalar perturbations, for which we report a 5.4 cr deviation from scale invariance, n s ± 1. Increasing the 
multipole range beyond I 1500 does not increase our accuracy for the ACDM parameters, but instead allows us to study extensions beyond the standard model. 
We find no indication of significant departures from the ACDM framework. Finally, we report a tension between the Planck best-fit ACDM model and the low-/ 7 
spectrum in the form of a power deficit of 5-10% at I < 40, with a statistical significance of 2.5-3 cr. Without a theoretically motivated model for this power deficit, 
we do not elaborate further on its cosmological implications, but note that this is our most puzzling finding in an otherwise remarkably consistent dataset. 

Key words. Cosmology: cosmic background radiation - Surveys - Methods: data analysis 
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1. Introduction 

This paper, one of a set associated with the 2013 release of 
data from the Planck 1 mission (Planck Collaboration 1 2013), de- 
scribes the CMB power spectra and the likelihood that we derive 
from the Planck data. 

The power spectrum of the Cosmic Microwave Background 
(CMB) is a unique signature of the underlying cosmolo- 
gical model (e.g., Spergeletal. 2003; Hinshaw et al. 2012). 
It has been measured over the whole sky by COBE and 
WMAP, and over smaller regions by ground-based and sub- 
orbital experiments (e.g., Tristram et al. 2005; Jones et al. 
2006; Reichardt et al. 2009; Fowler et al. 2010; Das et al. 2011; 
Keisler et al. 2011; Story et al. 2012; Das et al. 2013). By map- 
ping the whole sky to scales of a few arcminutes, Planck now 
measures the power spectrum over an unprecedented range of 
scales from a single experiment. To estimate cosmological para- 
meters from the power spectrum requires a likelihood function 
that propagates uncertainties. 

In this paper we describe the power spectra obtained from 
the Planck temperature data, as well as the associated likeli- 
hood function. Since the probability distribution of the power 
spectrum is non-Gaussian at large scales, we follow a hybrid ap- 
proach to construct the likelihood (Efstathiou 2004, 2006), us- 
ing a Gibbs sampling based approach at low multipoles, and a 
pseudo-Q technique at high multipoles (Hivon et al. 2002). 

The high-^ part of the Planck likelihood is based on power 
spectra estimated from each Planck detector in the frequency 
range 100 to 217 GHz, allowing careful assessment of each de- 
tector's response to the sky emission. We implement two inde- 
pendent likelihood methods. The first, used in the distributed 
likelihood code, estimates the power spectrum at every multi- 
pole, together with the associated covariance matrix. The second 
takes a simplified form, binning the spectra, and is used to ex- 
plore the stability of the results with respect to different instru- 
mental and astrophysical systematic effects. The methods give 
consistent results. 

Unresolved extragalactic foregrounds make a significant 
contribution to the power spectra at high multipoles. We de- 
velop a model for these foregrounds, designed to allow the 
Planck likelihood to be combined with high resolution data 
from the Atacama Cosmology Telescope (ACT) and the South 
Pole Telescope (SPT). We combine frequencies and model unre- 
solved foregrounds in a physical way, as in e.g., Shirokoff et al. 
(2010); Dunkley et al. (2011); Reichardt et al. (2012), perform- 
ing component separation at small scales at the power spectrum 
level. On large scales, I < 50, Galactic contamination is more 
significant. We use the Planck temperature maps in the range 
30 < v < 353 GHz to separate Galactic foregrounds in the maps, 
and then estimate the full probability distribution of the CMB 
power spectrum. 

This paper is structured as follows. In Sect. 2 we describe 
the pseudo-Q likelihoods, and in Sect. 3 set up the foreground 
model. The power spectra and derived cosmological parameters 
are presented in Sects. 4 and 5, and an assessment of their accur- 
acy and robustness is made in Sects. 6 and 7. 

In Sect. 8 we describe the low-£ likelihood, and conclude by 
presenting the complete Planck likelihood in Sect. 9. 

1 Planck (http://www.esa.int/Planck) is a project of the 
European Space Agency (ESA) with instruments provided by two sci- 
entific consortia funded by ESA member states (in particular the lead 
countries France and Italy), with contributions from NASA (USA) and 
telescope reflectors provided by a collaboration between ESA and a sci- 
entific consortium led and funded by Denmark. 



Table 1. Detectors used to make the maps for this analysis. 
Spider Web Bolometers (SWB) are used individually; Polarised 
Sensitive Bolometer pairs (PSBs, denoted a and b) are used in 
pairs, and we consider only the maps estimated from two pairs 
of PSBs. The relevant effective beams, and their uncertainties, 
are given in Planck Collaboration VII (2013). 



Set name Frequency Type Detectors FWH1VP 

[GHz] [arcmin] 

100-ds0 100 PSB All 8 detectors 9.65 

100-dsl 100 PSB la+lb + 4a+4b 

100-ds2 100 PSB 2a+2b + 3a+3b 

143-dsO 143 MIX 11 detectors 7.25 

143-dsl 143 PSB la+lb + 3a+3b 

143-ds2 143 PSB 2a+2b + 4a+4b 

143-ds3 143 SWB 143-5 

143-ds4 143 SWB 143-6 

143-ds5 143 SWB 143-7 

217-dsO 217 MIX 12 detectors 4.99 

217-dsl 217 PSB 5a+5b + 7a+7b 

217-ds2 217 PSB 6a+6b + 8a+8b 

217-ds3 217 SWB 217-1 

217-ds4 217 SWB 217-2 

217-ds5 217 SWB 217-3 

217-ds6 217 SWB 217-4 



2. High-l likelihoods 

The Planck maps consist of the order 5 x 10 7 pixels for each de- 
tector, so a likelihood described directly at the pixel level would 
be too time consuming. A significant compression of data can be 
achieved with minimal information loss using pseudo-Q power 
spectra, even in the case of incomplete sky coverage. Here we 
describe the form of the likelihood function of the compressed 
data, given a sky signal and instrumental model. 

Following Hamimeche & Lewis (2008), we assume a 
Gaussian form of the likelihood based on pseudo- spectra that 
have been corrected to account for partial sky masking. We use 
a 'fine-grained' data description, computing spectra of maps 
from individual detectors or detector sets. Table 1 describes the 
13 maps used in the analysis, spanning 100 to 217 GHz. We 
compute the spectra at these multiple frequencies to simultan- 
eously constrain the CMB and foreground contributions. We 
choose these frequencies as a trade-off between adding further 
information, and adding further complexity to the foreground 
model, which would be needed to including the adjacent 70 and 
353 GHz channels (see Sect. 3 for further discussion). In our 
baseline analysis the spectra are computed at each multipole, to- 
gether with an estimate of the full covariance matrix with off- 
diagonal errors between different spectra and multipoles. As in 
the WMAP analysis, we use only cross-spectra between detect- 
ors, alleviating the need to accurately model the mean noise con- 
tribution. 

In this section, we begin with a reminder of the pseudo- 
spectrum approach, and describe our baseline likelihood distri- 
bution, hereafter referred to as the CamSpec likelihood. We then 
show how a compression of spectra within a given frequency can 
be achieved with negligible loss of information. We describe the 
signal and instrument model, including detector noise properties, 
calibration, and beam uncertainties. 

Next, we describe an alternative, simpler, form of the likeli- 
hood, hereafter referred to as Plik, based on binned power spec- 
tra with an inverse- Wishart distribution. This does not require 
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the pre-computation of large covariance matrices, so changing 
the sky or instrument modeling is straightforward. This simpler 
form of the likelihood will be used to assess the robustness of our 
likelihood methodology with respect to technical choices and as- 
trophysical foreground modeling. 

In Sect. 7.5 we also compare these likelihoods to pseudo- 
spectra computed directly from CMB maps estimated by multi- 
frequency component separation (Planck Collaboration XII 
2013). 

2.1. The CamSpec likelihood 

We define t l im as the spherical harmonic coefficients of the 
weighted temperature map of detector i. The pseudo-spectrum 
at multipole I, for the detector pair (/, j), is then given by 

m 

where the dagger, f , denotes the Hermitian transpose. This is re- 
lated to the 'deconvolved' spectrum, C Tij , by a coupling matrix, 

C Ti ' = M]JC Ti >. (2) 

For an isotropic signal on the sky, the ensemble average of these 
deconvolved spectra are equal to the spectra of the theoretical 
models (including CMB and isotropic unresolved foregrounds) 
that we wish to test. For completeness, the coupling matrices are 
given explicitly in Appendix A. 1 . 

In the first method, CamSpec, we form the deconvolved spec- 
tra Q without any prior smoothing of the pseudo- spectra Q. 
Even for the largest sky masks used in our analysis (see Sect. 3), 
the coupling matrices are non- singular. The deconvolution re- 
quires the evaluation of ~ A^ ap coupling matrices for a data set 
with A^p sky maps, which takes a moderate, but not excessive, 
amount of computer time. 

A more challenging computational task is to compute the 
covariances of the pseudo- spectra, i.e., Cov(C Tij C Tpq ). Here we 
need to compute A^ ap coupling matrices, and the problem rap- 
idly becomes computationally intractable even for relatively low 
values of N map . For the moment we will assume that these cov- 
ariance matrices are available and describe their computation in 
Appendix A.4. We will use the notation X = Vec(C) to denote 
a column vector for which the index p of a single element X p 
denotes the map combination (/, j) and multipole £. We denote 
the covariance matrix of this vector as 

A( = (X-<X))(X-<X)) r . (3) 

As explained later, the deconvolved detector set cross- 
spectra given by Eq. 2 can be efficiently combined within a 
given frequency pair after a small effective recalibration, taking 
into account their respective isotropised beam transfer function 
and noise levels (see Appendix A. 3 for the detailed procedure). 
Covariance estimates of these combined spectra can be deduced 
from those of the detector set cross- spectra. The covariance mat- 
rix is computed for a fixed fiducial model, and we approximate 
the likelihood as a Gaussian, described in Appendix A.5. The 
likelihood thus takes the form p = e~ s with 

S = \(X-X) T M-^X-X). 

For the current analysis we include the following (decon- 
volved) spectrum combinations, 

X = (c 100xl0 ° c U3xl43 c 217x217 c l43x2il ) (4) 



coupled to a parametric model of the CMB and foreground 
power spectra. The multipole ranges we select depend on fre- 
quency, as described in Sect. 5. We do not include the 100 x 143 
and 100 x 217 spectra since these spectra carry little additional 
information about the primary CMB anisotropies, but would re- 
quire us to solve for additional unresolved foreground paramet- 
ers. This tradeoff of information versus complexity was also con- 
sidered for the use of the 70 GHz and 353 GHz data, which we 
choose not to include except for cross-checks. 

The fiducial covariance matrix is composed of the blocks 
shown in Fig. 1. The off-diagonal blocks in this matrix accur- 
ately account for the correlations between the power spectra at 
different frequencies. 

This description would be sufficient for perfectly known 
calibrations and beam transfer functions of each detector sets' 
cross- spectra. Planck Collaboration VII (2013) describes in de- 
tail these uncertainties, and shows that for each detector set pair, 
(/, j), the effective beam transfer function can be expressed as 

( ^mode s \ 

described further in Appendix A. 6, with ft mo des beam error ei- 
genmodes E l ^(£), and their covariance matrix. These modes are 
then combined into generalised beam eigenmodes correspond- 
ing to the spectra X. The associated covariance matrix is used 
to construct a Gaussian posterior distribution of the eigenmodes, 
which allows marginalization over the uncertainties. 

Finally, in the construction of the covariance matrix, one 
needs to accurately specify the contribution of the instrumental 
noise. Even if there is no bias on the spectra due to instru- 
mental noise, having removed auto-spectra, the latter dominates 
the covariance matrix on small scales. Fortunately, the Planck 
scanning strategy at the ring level allows us to make estim- 
ates of the noise pseudo-spectra from half-ring difference maps 
(see Planck Collaboration XII 2013). These half-ring difference 
maps, together with the knowledge of the noise variance per 
pixel for each detector set, can be used to derive the noise 
contribution to the covariance matrix with good accuracy (see 
Appendix A. 8 for details). 

2.2. The Plik likelihood 

We now describe the alternative form of the likelihood, inspired 
by Cardoso et al. (2008), used for cross-checks and robustness 
tests. We start from the full-sky exact likelihood for a Gaussian 
signal, which for N map detector maps is given by 

/>(maps|0) oc exp ~{J](2t + 1)7C(Q, Q(0))}, 

i 

where is a vector containing the parameters of the signal 
model, and Q are the empirical angular spectra. 7C(A, B) de- 
notes the Kullback divergence between two ft-variate zero-mean 
Gaussian distributions with covariance matrices A and B, and is 
given by 

7C(A, B) = ift^AB" 1 ) - logde^AB" 1 ) - n\ 

As already noted, a sky cut introduces off-diagonal couplings 
between different multipoles. In this method we bin the power 
spectra in such a way that these off-diagonal terms of the cov- 
ariance are negligible. This is adequate to model sources with 
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M = 



(100 x 100) x (100 x 100) 



(143 x 143) x (100 x 100) 



(217x 217)x(100x 100) 



(143 x 2 17) x (100 x 100) 



(100 x 100) x (143 x 143) 



(143 x 143) x (143 x 143) 



(217 x 217) x (143 x 143) 



(143 x 217) x (143 x 143) 



(100 x 100) x (217x 217) 



(143 x 143) x (217x 217) 



(217 x 217) x (217x217) 



(143 x217)x (217x217) 



(100 x 100) x (143x217) 



(143 x 143) x (143x 217) 



(217 x 217) x (143x217) 



(143 x 217) x (143x 217) 



Figure 1. The covariance matrix blocks used in the likelihood, accounting for the correlations between cross-spectra estimated from 
the 100, 143, and 217 GHz channels. 



slowly varying spectra, such as foregrounds, and the CMB an- 
isotropics for standard cosmologies. In this case, the likelihood 
takes the form 

Q 

/?(maps|0) oc exp -£(6), with £(0) = ^ n q 7C(C q , C q ), 

q=l 

where the angular spectra Q for each cross-frequency spectrum 
have been averaged into Q spectral bins using spectral windows 
w q (e)(q=l,...,Q),vnih 



Here w q {€) denotes the window function for the q-th bin, and the 
same symbol, C, is used to denote binned or unbinned spectra. 
The effective number of modes in the q-th bin is 



Kg — fsky 



^w q {tfi(2t + \y 

We adopt a spectral binning defined by 

( l(l+l)(2l+l) nq < P < 

min — — max ' 

otherwise. 



\ min 

o 



The Plik bin width is A€ = 9 from I = 100 to I = 1503, then 
M = 17 to I = 2013, and finally M = 33 to 4iax = 2508. This 
ensures that correlations between any two bins are smaller than 
10%. 

While this binned likelihood approximation does not fully 
capture all couplings between different multipoles, it has a not- 
able advantage in computational speed, and it agrees well with 
the primary likelihood. It is therefore very well suited for per- 
forming an extensive suite of robustness tests, as many more 
parameters can be considered in a short time. Further, instru- 
mental effects can be investigated quickly to assess the agree- 
ment between pairs of detectors within a frequency channel, 
such as individual detector calibrations and beam errors. 

A specific example is the impact of beam uncertainty para- 
meters on the likelihood. This can be investigated by re- 
expressing the model covariance matrices as 



where Q(0) is the model covariance including both signal and 
noise, and B^(y) is a diagonal matrix encoding the beam (and 
calibration) errors with elements given by 2 



I "mooes 

Bj(y) = exp 24^(0 



k=\ 



(8) 



Here, E l £(€) are the eigenmodes of the (auto-)spectra, similar to 
Eq. 5. Note that Eq. 7 does not contain the mean beam transfer 
function, since it is already included in the empirical spectra. 
Thus, using Eq. 7 Plik approximates the cross-spectrum beam 
errors as the harmonic mean of the corresponding auto- spectrum 
beam errors, under the assumption that is diagonal between 
detectors. This approximate factorisation is intrinsically linked 
to the assumed Kullback shape of the Plik likelihood, and is 
later demonstrated to work well for both simulations and data. 

The Plik likelihood method also provides a direct estimate 
of the detector noise power spectra as it can include the empirical 
auto- spectra, and we find that these noise estimates are in good 
agreement with the noise spectra used to construct the CamSpec 
likelihood covariance matrix. The method can also produce a 
binned CMB power spectrum independent of the underlying cos- 
mological model, providing a direct quality assessment of the 
foreground model parametrisation. In practice, we proceed in 
two steps. First, we jointly estimate the noise together with all 
other parameters using both auto and cross-spectra. Then we fix 
the noise estimates, and use the fiducial Gaussian approxima- 
tion to explore the remaining free parameters excluding the auto- 
spectra, optionally including only specific data combinations. 

3. Foreground emission model and sky masks 

3.1. Sky masks 

The Galactic emission varies strongly in both complexity and 
strength across the sky. It is therefore necessary to find a bal- 
ance between maximizing the sky coverage to reduce statistical 
uncertainties, and establishing a simple yet efficient foreground 
model. In this paper, we threshold the 353 GHz temperature 
map to define a basic set of diffuse Galactic masks (shown in 
Appendix B), which form a sequence of increasing sky fraction, 
to minimize the contribution from diffuse dust emission. The sky 
fractions retained by these masks is summarized in Table 2. 

For Planck, we need to estimate the covariance matrices 
to percent level precision. For temperature spectra, and in the 



Q = B,(y)Q(0)B,(y) T 



(7) 



From Eqs. 5, 7, and 8, we have 6 l k = g%/2 at first order. 
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Table 2. Area of sky retained by combining diffuse foreground 
and point source masks, once apodised. 



Mask 


Sky fraction 


Sky area 




[%] 


[deg 2 ] 


CL31 


30.71 


12 668 


CL39 


39.32 


16223 


CL49 


48.77 


20121 




Figure 2. The set of masks (CL31, CL39, CL49) used for the 
likelihood analyses. 



absence of point source holes, this precision can be achieved 
with sharp, non-apodised Galactic masks (Efstathiou 2004). 
However, the inclusion of point source holes introduces non- 
negligible low-£ power leakage, which in turn can generate 
errors of a few percent in the covariance matrices. We re- 
duce this leakage by apodising the diffuse Galactic masks (see 
Appendix B for details). The point source mask is based on the 
union of the point sources detected between 100 and 353 GHz, 
and is also apodized. The point source flux cut is not critical, 
since the amplitudes of the Poisson contributions of unresolved 
sources are allowed to vary over a wide range in the likelihood 
analysis. Thus, we do not impose tight priors from source counts 
and other CMB experiments on the Poisson amplitudes. A set of 
the combined Galactic and point source masks, referred to as 



'CLx', where 'x' is the percentage of sky retained, are shown in 
Fig. 2. 

3.2. Galactic emission 

The contamination from diffuse Galactic emission at low to in- 
termediate multipoles can be reduced to low levels compared to 
CMB anisotropies by a suitable choice of masking. However, 
even with conservative masking, the remaining Galactic emis- 
sion at high multipoles is non-negligible compared to other un- 
resolved components, such as the Cosmic Infrared Background 
(CIB) anisotropies at 143 and 217 GHz. A clear way of demon- 
strating this is by differencing the power spectra computed with 
different masks, thereby highlighting the differences between 
the isotropic and non-isotropic unresolved components. Figure 3 
shows (up to £ < 1400) the 217 GHz power spectrum difference 
for the maskl and maskO masks 3 , minus the corresponding dif- 
ference for the 143 GHz frequency channel. Any isotropic con- 
tribution to the power spectrum (CMB, unresolved extragalactic 
sources, etc.) will cancel in such a double difference, leaving a 
non-isotropic signal of Galactic origin, free of the CMB induced 
cosmic variance scatter. Above £ > 1400, Fig. 3 shows the mask 
differenced 217 GHz power spectrum, as the instrumental noise 
becomes significant at £ > 1400 for the 143 GHz channel. 

In the same figure, these difference spectra are compared to 
the unbinned mask-differenced 857 GHz power spectrum, scaled 
to 217 GHz adopting a multiplicative factor 4 of (9.93 x 10" 5 ) 2 ; 
the dotted line shows a smooth fit to the unbinned spectrum. 
The agreement between this prediction and the actual dust emis- 
sion at 217 GHz is excellent, and this demonstrates conclusively 
the existence of a small-scale dust emission component with an 
amplitude of ~ 5 - 15 jjK 2 at 217 GHz if maskl is used. 

For cosmological parameter analysis this small-scale dust 
component must be taken into account, and several approaches 
may be considered: 

1. Fit to a template shape, e.g., as shown by the dotted line in 
Fig. 3. 

2. Reduce the amplitude by further masking of the sky. 

3. Attempt a component separation by using higher frequen- 
cies. 

The main disadvantage of the third approach is a potential 
signal-to-noise penalty, depending on which frequencies are 
used, as well as confusion with other unresolved foregrounds. 
This is particularly problematic with regards to the CIB, which 
has a spectrum very similar to that of Galactic dust. In the fol- 
lowing we therefore adopt the two former solutions. 

It is important to understand the nature of the small scale dust 
emission, and, as far as possible, to disentangle this emission 
from the CIB contribution at the HFI cosmological frequencies. 
We use the 857 GHz power spectrum for this purpose, noting 
that the dust emission at 857 GHz is so intense that this partic- 
ular map provides an effectively noise-free dust emission map. 
In Fig. 4 we again show the 857 GHz mask power spectrum dif- 
ference, but this time plotted on a log-log scale. The solid line 
shows the corresponding best-fit model defined by 

_ A(100/£) a 

T> f = — — (9) 

3 These are the combination of the non-apodised Galactic masks G35 
and G22 with the apodised point source mask PSA82. 

4 The scaling coefficient for the 143 GHz spectrum is (3.14 x 10~ 5 ) 2 , 
derived from the 7-parameter fitting function of Eq. A.46. 
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Figure 3. Differences between power spectra evaluated from 
masks 1 and maskO, showing the presence of Galactic dust. 
For £ < 1400 the spectra show the 217-143 "double- 
differenced" power spectrum, rescaled to correct for dust emis- 
sion at 143 GHz. For £ > 1400 the 217 mask differenced power 
spectrum is plotted. The blue line shows the 857 GHz mask- 
differenced power spectrum scaled to 217 GHz as described in 
the text, fit by the dotted line. 
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Figure 4. 857 GHz mask-differenced power spectrum (points), 
interpreted as Galactic dust emission. The solid line shows the 
best- fit model defined by Eq. 9. 

with A = 5.729 x 10 VK 2 , a = 0.169, £ c = 905, and y = 
0.427. At high multipoles this fit asymptotically approaches 
Q oc £~ 2 - 6 , which is compatible with previous knowledge 
about diffuse Galactic emission, i.e., a power-law behaviour with 
an index close to -3 extending to high multipoles (see e.g., 
Miville-Deschenes et al. 2007). 

The upper panel in Fig. 5 shows the 857 GHz spectra for the 
four Galactic masks (G22, G35, G45 through to G56) with the 
point-source mask applied. They are compared to the 857 GHz 
CIB power spectrum from Planck Collaboration et al. (2011c), 
which for I > 500 can be described approximately as £Ci ~ 
7 x 10 6 yuK 2 . The best fit models of Eq. 9 are also shown, fitted to 



£ < 500, where we expect diffuse emission to be dominant. The 
lower panel of Fig. 5 shows the same power spectra after sub- 
tracting the best-fit dust model. After subtracting the Galactic 
dust component, the recovered power spectra are consistent with 
the CIB measured in Planck Collaboration et al. (2011c) for all 
masks. The excess at high multipoles may be due to a combin- 
ation of aliasing of large scale power through the point source 
masks at £ > 3000, Galactic point sources, and uncertainties in 
the 857 GHz beams. 

The model explains by construction the 'double-difference' 
plot shown in Fig. 3. Specifically, this emission is consistent with 
a cirrus-like power spectrum, Q oc £~ 2 - 6 , extrapolated to high 
multipoles. Furthermore, the results of Fig. 5 demonstrate that 
over a wide area of sky, we can understand the 857 GHz power 
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Figure 5. Top: 857 GHz power spectra for the four different 
masks defined in the text. The dotted lines show the best-fit 
model defined by Eq. 9 fit to £ < 500, capturing the Galactic 
dust. An estimate of the CIB power spectrum is shown in or- 
ange points Planck Collaboration et al. (2011c). Bottom: Power 
spectra after subtracting the Galactic dust model. 
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spectrum in terms of a 'universal' cirrus spectrum together with 
an isotropic CIB component. These results provide strong evid- 
ence that an extragalactic CIB component dominates over the 
diffuse Galactic emission at multipoles £ > 500 over the full 
range of HFI frequencies outside the CL3 1 mask. 

We take a different approach for the Galactic dust correction 
with the Plik likelihood. Rather than correcting the empirical 
spectra during a pre-processing step, the Plik likelihood imple- 
ments an explicit one-parameter model that describes the dust 
contribution to the cross-spectrum between detectors i and j, 



C?»%, j) = A Dust F(v h v )F(vjV ) ( — 



^Dust Dust 
5f 5j ■ 



Here 



F(v v ) : 



v^ d B(T d 



dB(T CM B,y) 
dT 



v) yfi«B(T d ,vo) 

1 dB(T CMB ,v ) ' 
dT 



(10) 



(11) 



where the dust amplitude, A Dust , is measured in units of yuK 2 , 
Vi is the reference frequency for map i, vo is a reference fre- 
quency which is taken to be 143 GHz, B(T, v) is the emission 
law of a blackbody with temperature T, and the dust color- 
correction terms, gf ust , are computed by integrating the dust 
spectrum within the spectral band of each detector (set). We 
fix the frequency and angular scaling parameters to y d - 2.6, 
p d = 1 .6 and T d = 18K. 



3.3. Poisson power from unresolved point sources 

Unresolved galaxies contribute both shot noise and clustered 
power to the Planck maps. The Poisson contribution leads to a 
scale independent tern, Q = constant. We model this power with 
a single amplitude parameter for each auto-spectrum (A^J , A^ 3 , 
and and a cross correlation coefficient for each cross spec- 
trum (A^4 3x217 = r C iB y/(A™A™)). These quantities are not 
of primary interest for cosmological results, so to avoid mod- 
elling error we do not separate the power into that sourced by 
"dusty" or "radio" galaxies (i.e., with increasing or decreasing 
brightness with frequency, respectively) as is done in the ana- 
lysis of the ACT and SPT power spectra (Dunkley et al. 2011; 
Reichardt et al. 2012; Dunkley et al. 2013). We also make no as- 
sumptions about their coherence between frequencies. 

The Poisson power can be related to the flux density dN/dS 

via 



C f 



U dh L 



>Scut(«) 



dS s z 



dS' 



(12) 



where we have explicitly introduced the Planck flux cut S cu t(h). 
Since Planck utilizes a constant signal-to-noise cut, and the 
Planck noise varies significantly across the sky, this flux cut has a 
spatial dependence. Although this does not alter the shape of the 
Poisson term, extra care must be taken when comparing results 5 
with models of dN/dS . In Sect. 7.3, we explore the consistency 
between the Poisson power recovered from the Planck power 
spectrum analysis and predictions from source count measure- 
ments. 



5 One must also account for the fact that these numbers corres- 
pond to the amplitude for a suitably averaged spectral band, which 
is approximately that of the map, and is described in detail in 
Planck Collaboration IX (2013). 



3.4. Clustered power from unresolved point sources 

Unresolved galaxies also contribute power because they trace 
large-scale structures. The mean flux from the radio galax- 
ies is much smaller than that from the dusty galaxies, so 
only the dusty galaxies contribute a significant clustering term 
(Millea et al. 2012). The CIB clustering has been studied ex- 
tensively, starting with Bond et al. (1986, 1991). Further the- 
oretical investigation (Scott & White 1999; Haiman & Knox 
2000) was stimulated by the detection of the infrared back- 
ground in the COBE data (Puget et al. 1996; Fixsen et al. 
1998), and the detection of bright "sub-millimeter" galaxies 
in SCUBA data (Hughes et al. 1998). Subsequently, the clus- 
tering has been detected at 160 microns (Lagache et al. 2007), 
at 250, 350 and 500 microns by the Balloon-borne Large 
Aperture Submillimeter Telescope (BLAST, Viero et al. 2009; 
Hajian et al. 2012) and at 217 GHz by SPT and ACT (Hall et al. 
2010; Dunkley et al. 2011). Recent Planck measurements of 
the Cosmic Infrared Background (Planck Collaboration et al. 
2011c) have extended the measurements at 217 GHz, 353 GHz, 
and 545 GHz to larger scales, and recent Herschel measurements 
(Amblard et al. 2011) have improved on the BLAST measure- 
ments and extended them to smaller angular scales. 

Rather than attempt to establish a physical model of the CIB, 
we adopt in this analysis a phenomenological model that cap- 
tures the CIB uncertainties for both Planck and high-^ experi- 
ments. Our baseline CIB model is a power-law spectrum with 
a free spectral index, oc £y , with an amplitude at each 

frequency, A™ and A™ , and a cross-correlation between fre- 
quencies, A™ x217 = r C iB V^™^217 )• We assume mat me CIB 
clustering power at 100 GHz is negligible. 

3.5. Unresolved Sunyaev-Zeldovich effects 

Based on analysis of ACT and SPT data, the thermal Sunyaev- 
Zeldovich (tSZ) contribution is expected to contribute ap- 
proximately D^ooo ~ 9//K 2 at 100 GHz and Df= 3000 ~ 
at 143 GHz (Reichardt et al. 2012; Dunkley et al. 2013; 
Sievers et al. 2013). The kinetic Sunyaev-Zeldovich (kSZ) ef- 
fect is expected to have a similar, or smaller, contribution, 
with £^3 000 £ 7//K 2 . In addition, theoretical arguments 
(Reichardt et al. 2012; Addison et al. 2012c) suggest that there 
should be a tSZ x CIB correlation that should contribute about 
the same order of magnitude as the kSZ term at 143 GHz. 

For Planck, all of these SZ contributions are small in com- 
parison to other unresolved foregrounds and are therefore poorly 
constrained by Planck data alone. Nevertheless, to eliminate bi- 
ases in cosmological parameters (Millea et al. 2012; Zahn et al. 
2005), we model their contributions, with appropriate con- 
straints from higher resolution CMB experiments, using three 
templates. 

First, for the thermal SZ effect we adopt the family of 
templates described by Efstathiou & Migliaccio (2012). These 
are based on the Komatsu & Seljak (2002) model, but use the 
'universal' X-ray electron pressure profile, P e , of Arnaud et al. 
(2010) extrapolated to high redshift via 



p e (z) oc [(i - n A )(i + z f + q a ] 4 / 3 -/ 2 . 



(13) 



Here, e describes departures from self-similar evolution, and a 
value of 6 = 0.5, which is adopted as the default for para- 
meter estimation purposes, provides a good match to the results 
from recent hydrodynamical numerical simulations incorporat- 
ing feedback processes (Battaglia et al. 2010, 2012). Figure 6 
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Figure 6. SZ templates at 143 GHz computed for a normaliz- 
ation of cr 8 = 0.8. The tSZ templates are from the model of 
Efstathiou & Migliaccio (2012) for three values of the evolu- 
tion parameter 6, e = (top), e = 0.25 (middle) and e = 0.5 
(lower). The kinetic SZ template is from Trac et al. (2011). 
The tSZxCIB cross correlation (143 x 143 GHz) is from the 
Addison et al. (2012c) template with parameters described in the 
text, and is negative for 143 x 143 GHz. 



shows the tSZ templates for three values of e; the template shape 
is not particularly sensitive to e. We treat the (dimensionless) 
normalization of the tSZ template at 143 GHz as an adjustable 
parameter, 

(14) 



^emplate 

t 



This parameter fixes the amplitude at 100 GHz via the frequency 
dependence of the tSZ effect, 



X = 



hv 
kf' 



(15) 



We neglect the tSZ at 217 GHz. 

Second, for the kinetic SZ effect we adopt the template de- 
scribed by Trac et al. (201 1), and as in Eq. 14 we treat the dimen- 
sionless amplitude of the template, A kSZ , as a free parameter, 



£)kSZ _ ^kSZ^template 



(16) 



Third and finally, for the cross-correlation between the 
thermal SZ component and the CIB we adopt the template de- 
scribed by Addison et al. (2012b). In this case, the amplitude is 
parameterised in terms of a single correlation coefficient, 



,tSZxCIB 



£)tSZxCIB 



'U^I^T^J^ (143 x 143), 
-^^T^Tf^I^ (143x217). 

(17) 

These templates are plotted in Fig. 6, normalized to erg = 0.8 
and with £ = 1.0 using a fiducial CIB amplitude. Note that with 
these parameters, the tSZxCIB cross- spectrum approximately 
cancels the kSZ spectrum at 143 GHz. 

As seen in Fig. 6, the SZ contributions are at the level of 
a few yuK 2 , which, although small, must be taken into account 



to assess inter-frequency residuals. However, one can see that 
these templates have similar shapes at multipoles < 2000, and 
therefore they cannot be disentangled using Planck data alone. 
On the other hand, higher resolution experiments can break 
this degeneracy, and as shown in Planck Collaboration XVI 
(2013), the combination of Planck, ACT, and SPT, better con- 
strains the amplitude of the thermal SZ effect. The ACT and 
SPT data at 150 GHz can be fitted to high (sub-//K 2 ) accur- 
acy without kSZ and tSZxCIB templates, yet we expect a kSZ 
contribution of at least the amplitude shown in Fig. 6, and 
larger if we account for patchy reionization (see, e.g., Knox 
2003, and references therein) and references therein). This im- 
plies a cancellation of the kSZ and tSZxCIB contributions at 
150 GHz (Addison et al. 2012c), as discussed in greater detail in 
Planck Collaboration XVI (2013). 



4. Combined cross-spectra and consistency checks 

The large number of cross spectra in a detector-by-detector 
power spectrum analysis allows for a number of internal consist- 
ency checks of the data. Within a frequency band, we expect to 
see exactly the same sky signals (primordial CMB, Galactic, and 
extra-galactic foregrounds), and so any intra-frequency residuals 
will reflect instrumental systematics, for example beam errors, 
'gain' fluctuations, and band-pass mismatch. In contrast, inter- 
frequency residuals are harder to analyse because the sky sig- 
nals vary with frequency. An accurate model of the unresolved 
foregrounds is therefore required to assess inter-frequency re- 
siduals. Furthermore, as we will show below, the scatter caused 
by chance CMB-foreground cross-correlations can dominate the 
inter- frequency residuals. For a precision experiment such as 
Planck, where the power spectra are expected to be signal dom- 
inated over a wide multipole range, intra- and inter-frequency 
residuals provide a powerful way of assessing possible system- 
atic errors. It is essential that contributions of systematic errors 
to both types of residual are small enough that they have negli- 
gible impact on cosmological parameter analysis. 

Figure 7 shows a selection of temperature cross-spectra and 
estimates of the analytic covariance matrices, together with the 
best-fit cosmological model described in Sect. 5. Unresolved 
foregrounds have been subtracted using the best-fit foreground 
parameters of the model described in Sect. 3. The scatter var- 
ies substantially between cross-spectra, reflecting differences in 
the instrument noise and effective resolution of different de- 
tector combinations. The analytic error model summarized in 
Appendix A. 2 is indicated, modified by the non- white noise cor- 
rection. This model provides an excellent description of the scat- 
ter seen in the data, over the full multipole range shown in plots, 
with an accuracy of a few percent or better. 



4.1. Intra-frequency residuals 

In this section we analyse the intra-frequency residuals at 143 
and 217 GHz. There are Af spe c = 10 cross-spectra at 143 GHz and 
15 cross-spectra at 217 GHz 6 . At each frequency, we solve for 
multiplicative ('effective' calibration) coefficients, y*, that min- 
imise 



^ 2 = ZZ^-<Q» 2 , 

£ ij,j>i 



(18) 



there is only one cross-spectrum at 100 GHz 
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Figure 7. A selection of cross spectra from individual detectors, with the best fit unresolved foreground model subtracted. The 
best-fit six parameter ACDM model is shown, with ±l<x errors determined from the diagonal components of the analytic co variance 
matrices. The lower panel in each plot shows the residuals with respect to the model. 



where 

^> = F-Zw^ (19) 

yVspcc ijj>i 

subject to the constraint that y\ = 1 (where i = 1 corresponds 
to detector 5 at 143 GHz and detector 1 at 217 GHz). Note that 
the power spectra in Eq. 18 and 19 are corrected for beam trans- 
fer functions. To minimise the possible impact of beam errors 
and noise, we restrict the sum in Eq. 1 8 to the multipole range 
50 < i < 500 where the spectra are signal dominated. Numerical 
values for the calibration coefficients are given in Table 3, using 



mask CL3 1 . The calibration factors are insensitive to the choice 
of mask or multipole range. 

The results of Table 3 show that effective calibration factors 
of -0.2 % are quite typical for HFI maps, in the 100-217 GHz 
frequency range. These recalibrations are of the order of mag- 
nitude of the statistical errors of the calibrations on dipole (see 
Planck Collaboration VIII 2013, Table 2). Note that the data 
are corrected for individual bolometer time transfer functions 
(TTFs; Planck Collaboration VII 2013). For each detector, the 
TTF model is tuned to minimise survey differences and by con- 
struction normalised to unity at the spin frequency of the satellite 
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Table 3. Map calibration coefficients. 



map 




map 




143-5 


1.0000 


217-1 


1.0000 


143-6 


0.9988 


217-2 


0.9992 


143-7 


0.9980 


217-3 


0.9981 






217-4 


0.9985 


143-dsl 


0.9990 


217-dsl 


0.9982 


143-ds2 


0.9994 


217-ds2 


0.9975 



(0.01666 Hz) to preserve the dipole calibration. The consistency 
of intra-frequency power spectrum residuals therefore provides 
a test of the consistency of the TTFs in addition to the beam 
transfer functions. 

Figure 8 shows the remarkable consistency of the power 
spectra at each frequency. The upper panels show the spectra 
corrected for the beam and effective calibration, together with 
the mean cross spectra. The lower panels show the dispersion 
around the mean. In the signal dominated regime the cross spec- 
tra show an RMS dispersion of a few pK 2 (in bands of width 
A£ = 31), i.e., the band-averaged spectra are consistent to an ac- 
curacy of -0.1-0.2 %. This excess scatter has negligible impact 
on cosmological parameter analysis. 

The residuals of the cross spectra in band averages of width 
A£ ~ 61 are shown in Fig. 9, before and after correction for the 
effective intra-frequency calibrations. The reduction in scatter 
after correction is evident at £ < 500, and the residual scatter is 
consistent with instrument noise and beam errors. At 217 GHz, 
beam errors dominate over noise at multipoles < 1000. There is 
no evidence that the excess scatter is caused by a small number 
of 'anomalous' detectors. 

4.2. Inter-frequency residuals 

The results of the previous section show that the intra-frequency 
cross-spectra between detector/detector sets are consistent to 
within a few pK 2 at multipoles £ < 1000. In a likelihood ana- 
lysis, there is therefore little loss of information in compressing 
the power spectra for each distinct frequency combination, as 
opposed to retaining the spectra for each map pair. This com- 
pression greatly reduces the size of the data vector and its co- 
variance matrix, and speeds up the likelihood computation at 
high multipoles. In this section we inter-compare the residuals 
of these compressed power spectra. 

One might naively expect that with accurate foreground 
modelling, the inter-frequency residuals in the signal dominated 
regime should be reduced to levels comparable to those seen in 
the intra-frequency comparisons described in the previous sec- 
tion. This is incorrect. Figure 10 shows power spectrum differ- 
ences between the cosmologically significant spectra for Planck 
at high multipoles (143x143, 143x217, 217x217). In this figure, 
which is independent of the cosmological model, the best-fit un- 
resolved foreground model has been subtracted from each spec- 
trum, and relative calibration factors have been applied. Residual 
beam, calibration and unresolved foreground errors would show 
up in this figure as large-scale smooth residuals. 

In fact, we see small-scale residuals at multipoles £ < 
800 which are considerably larger than expected from instru- 
mental noise. This excess scatter arises from chance CMB- 
foreground cross-correlations. Even if the foreground contamin- 
ation is much smaller than the CMB, chance cross-correlations 
can produce scatter in the inter-frequency power spectra that 



6000 





I 

Figure 8. Cross spectra corrected for the beam and effective 
calibration, together with the mean power spectrum. Top: The 
10 cross spectra at 143 GHz. Bottom: The 15 cross spectra at 
217 GHz, with SWBxSWB spectra (magenta), SWBxdsl (red), 
SWBxds2 (green), dslxds2 (purple). The power spectra are dis- 
tinguishable only at high multipoles where the data become 
noise dominated. The lower panels show the dispersion of the 
cross spectra around the mean, together with a ±0.2% calibra- 
tion error. 



is large in the signal dominated regime. We demonstrate in 
Appendix C that the observed scatter can be predicted quant- 
itatively. 

At high enough multipoles, instrument noise, beam er- 
rors, and errors in foreground modelling dominate the inter- 
frequency residuals. A complete analysis of inter- frequency re- 
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siduals therefore requires the full likelihood machinery and 
MCMC analysis to determine foreground, beam and calibration 
parameters. We will therefore revisit the inter- frequency resid- 
uals in the following sections. 

5. Reference results of the high-l likelihood 

In this section we study the high-/ 7 Planck likelihood, and present 
the power spectrum and parameters derived from the baseline 
likelihood for the basic six-parameter ACDM model. In order 
to break the well-known degeneracy between the optical depth, 
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Figure 9. Cross spectra for the 143 GHz (top) and 217 GHz (bot- 
tom) channels, as in Fig. 8, before correction for multiplicative 
intra-frequency calibration coefficients (above), and after correc- 
tion (below). 
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Figure 10. Differences between the 143 x 143, 143 x 217, and 
217 x 217 cross spectra (plotted in bins of width S£ « 31). The 
best-fit model for unresolved foregrounds has been subtracted 
from each spectrum. The numbers list the dispersions over the 
multipole range 800 < £ < 1500. 



r, and the scalar index of scalar perturbations, n s , we adopt a 
Gaussian prior on r (inspired from WMAP7 data, i.e., 0.088 ± 
0.015, see Komatsu et al. 2011) instead of the low-/ 7 likelihood 
at £ < 50. We return to the global Planck results after introducing 
the low-£ likelihood. 

We choose separate masks for each frequency map to min- 
imise Galactic foreground emission. First, since the HFI data are 
signal-dominated at < 500, and diffuse Galactic emission is low 
at 100 GHz outside the CL49 mask, there is little to be gained 
from analyzing the dustier 217 GHz with the same region; it con- 
tains no new information about the primordial CMB. At higher 
multipoles, £ > 500, we use the CL31 mask, optimizing the sky 
coverage while ensuring a low amplitude of small-scale Galactic 
emission relative to the isotropic unresolved foregrounds and the 
primordial CMB. In addition, we tune the multipole range for 
each frequency to mitigate Galactic foreground contamination 
and beam errors. 

The choices of masks and angular ranges used in the high- 
£ likelihood are summarised in Table 4, together with basic x 2 
statistics with respect to the minimal ACDM model per cross- 
spectrum and combined. The 100 GHz cross-spectrum is com- 
puted over the largest sky fraction, a total of 49% of the sky, and 
measures the largest scales. On the other hand, it has lowest res- 
olution, and it is therefore only used for £ < 1200. The 143 GHz 
cross- spectrum has higher resolution, and is used for £ < 2000. 
Finally, the 217 GHz cross-spectrum has the highest resolution, 
but also the most Galactic dust contamination, and is therefore 
evaluated from only 31% of the sky, but including an angular 
rangeof500 <^< 2500. 

Given these masks and angular ranges, we compute the an- 
gular power spectra and covariance matrices, and construct the 
CamSpec likelihood. The angular power spectra for each fre- 
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2500 



Figure 11. Planck power spectra and data selection. The coloured tick marks indicate the Grange of the four cross-spectra included 
in CamSpec (and computed with the same mask, see Table 4). Although not used, the 70 GHz and 143 x 353 GHz spectra demonstrate 
the consistency of the data. The dashed line indicates the best-fit Planck spectrum. 



Table 4. Overview of of cross- spectra, multipole ranges and 
masks used in the Planck high-£ likelihood. Reduced x 2 $ with 
respect to the best-fit minimal ACDM model are given in the 
fourth column, and the corresponding probability-to-exceed in 
the fifth column. 



Spectrum Multipole range Mask x\cdmI v ^ PTE 

100 x 100 50- 1200 CL49 L01 O40 

143x 143 50-2000 CL31 0.96 0.84 

143 x217 500-2500 CL31 1.04 0.10 

217x217 500-2500 CL31 0.96 0.90 

Combined 50-2500 CL31/49 1.04 0.08 



quency combination are shown in Fig. 11, and compared to spec- 
tra derived from the 70 GHz and 353 GHz Planck maps. 

We use the likelihood to estimate six ACDM cosmolo- 
gical parameters, together with a set of 14 nuisance paramet- 
ers (11 foreground parameters, two relative calibration para- 



meters, and one beam error parameter 7 , described in Sect. 3. 
Tables 5 and 6 summarize these parameters and the associated 
priors 8 . Apart from the beam eigenmode amplitude and calibra- 
tion factors, we adopt uniform priors. To map out the corres- 
ponding posterior distributions we use the methods described 
in Planck Collaboration XVI (2013), and the resulting marginal 
distributions are shown in Fig. 12. Note that on the parameters 
A tSZ , A kSZ and A™ we are using larger prior ranges as compared 
to Planck Collaboration XVI (2013). 

Figure 12 shows the strong constraining power of the Planck 
data, but also highlights some of the deficiencies of a 'Planck 
-alone' analysis. The thermal SZ amplitude provides a good ex- 
ample; the distribution is broad, and the 'best fit' value is ex- 

7 The calibration parameters cioo and c 2 n are relative to the 143 x 
143 GHzcross-spectrum, whose calibration is held fixed. Only the first 
beam error eigenmode of the lOOx 100 GHz cross-spectrum is explored, 
all other eigenmodes being internally marginalised over 

8 We use the approximation 6 M c to the acoustic scale 0* (the ra- 
tio of the comoving size of the horizon at the time of recombination, 
r s , to the angular diameter distance at which we observe the fluctu- 
ations, D A ) which was introduced by Hu & Sugiyama (1996). 6 M c is 
commonly used, e.g., in CosmoMC, to speed up calculations; see also 
Kosowsky et al. (2002) for further details. 
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Table 5. Overview of cosmological parameters used in this analysis, including symbols, the baseline values if fixed for the standard 
ACDM model, and their definition (see text for further details). The top block lists the estimated parameters, with (uniform) prior 
ranges priors given in square brackets. The lower block lists derived parameters. 



Parameter Prior range Baseline Definition 



a) h = Q h h 2 [ 0.005, 0.1 ] ... Bary on density today 

aj c = Q c h 2 [ 0.001, 0.99] ... Cold dark matter density today 

100#mc [0.5 , 10.0 ] ... 100 x approximation to rJD A (used in CosmoMC) 

r ( 0.088 + 0.015 ) ... Thomson scattering optical depth due to reionization 

£l K [-0.3 0.3 ] Curvature parameter today with Q tot = 1 - £l K 

Yp [0.1 , 0.5 ] BBN Fraction of bary onic mass in helium 

n s [0.9 , 1.1 ] ... Scalar spectrum power-law index (& = 0.05 Mpc -1 ) 

ln(10 10 A s ) [2.7 , 4.0 ] ... Log power of the primordial curvature perturbations (k = 0.05 Mpc -1 ) 

Q A ... Dark energy density divided by the critical density today 

Age ... Age of the Universe today (in Gyr) 

Q m ... Matter density (inc. massive neutrinos) today divided by the critical density 

z re • • • Redshift at which Universe is half reionized 

H [20 , 100 ] ... Current expansion rate in km s -1 Mpc -1 



Table 6. Overview of parameters describing astrophysical foreground modeling, instrumental calibration and beam uncertainties, 
including symbols, definitions, and prior ranges (see text for further details). Square brackets denote hard priors, parentheses indicate 
Gaussian priors. The 'Likelihood' column indicates whether a parameter is used by the CamSpec (C) and/or Plik (P) likelihood. 
Note that the beam eigenmode amplitudes require a correlation matrix to fully describe their joint prior, and that all but J3\ are 
marginalized over internally rather than sampled explicitly. 



Parameter . . . Prior range Likelihood Definition 



A PS 

^100 


[0, 360] 


C 


Contribution of Poisson point-source power to D3000 100 f° r Planck (in yt/K 2 ) 




[0,400] 


P 




A PS 
A 143 


[0, 270] 


C 


As for A\l Q but at 143 GHz 




[0,400] 


P 




A PS 
^217 


[0,450] 


c 


As for A\l Q but at 217 GHz 




[0,400] 


p 




rPS 

' 143x217 ' ' ' 


. • [0,1] 


C,P 


Point- source correlation coefficient for Planck between 143 and 217 GHz 




4 CIB 
^143 


[0,50] 


C,P 


Contribution of CIB power to 143 at the Planck CMB frequency for 143 GHz (in fiK 2 ) 


4 CIB 
^217 


[0, 80] 


c 


As for A™ but for 217 GHz 




[0, 120] 


p 




r CIB 

' 143x217 ' ' ' 


. . [0,1] 


C,P 


CIB correlation coefficient between 143 and 217 GHz 




y CIB 


[-2, 2] (0.7 + 0.2) 
. . [-5, +5] 


c 
p 


Spectral index of the CIB angular power spectrum {T>t oc £ yCm ) 




A tsz 


[0, 50] 


C,P 


Contribution of tSZ to £>^ 143 at 143 GHz (in /uK 2 ) 




^kSZ 


[0,50] 


C,P 


Contribution of kSZ to D3000 (in //K 2 ) 




^rtSZxCIB 


. • [0,1] 


C,P 


Correlation coefficient between the CIB and tSZ (see text) 




^Dust 


. . [0,0.001] 


p 


Amplitude of Galactic dust power (in //K 2 ) 




C100 


[0.98,1.02] 
(1.0006 + 0.0004) 


c 


Relative power spectrum calibration for Planck between 100 GHz and 143 GHz 




Cl\l 


[0.95,1.05] 
(0.9966 + 0.0015) 


c 


Relative power spectrum calibration for Planck between 217 GHz and 143 GHz 






. . (0+1) 


c 


Amplitude of the j-th beam eigenmode (j = 1-5) for the i-th cross-spectrum (i = 


1-4) 


V 


. . [-3, +3] 


p 


Amplitude of the calibration eigenmode for the z'-th detector (set) (i = 1-13) 






. . [-3, +3] 


p 


Amplitude of the j-th beam eigenmode (j = 1-5) for the i-th detector(set) (i = 1- 


-13) 



eluded by the ACT and SPT high resolution CMB experiments 
(Reichardt et al. 2012). For the CIB amplitudes, the upper bound 
on e.g., A™ is significantly weaker than the ACT and SPT con- 
straints. To accurately estimate the foreground parameters at the 
< yuK 2 level, we need to supplement the Planck power spec- 
tra with temperature data from ACT and SPT, as described in 
Planck Collaboration XVI (2013). The fiducial model and fore- 
ground parameters used in the CamSpec likelihood are therefore 



derived from a joint Planck+ ACT '+SPT analysis and is not based 
on the parameters listed in Table 8. In the rest of this section, we 
will use the parameters of Table 8 to discuss inter- frequency re- 
siduals. 

Figure 13 shows the foreground residuals and total residuals 
after removing the best-fit foreground model for all spectra (in- 
cluding the 100 x 143 and 100 x 217 spectra, which are not used 
in the CamSpec likelihood). The first point to note here is that 
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Figure 12. Marginal posterior distributions for the six cosmolo- 
gical (top two rows) and eleven nuisance parameters (lower four 
rows) estimated with the CamSpec likelihood. 



Planck has a limited ability to disentangle foregrounds. While 
the Planck data constrain the Poisson point source amplitudes at 
each frequency, as well as the CIB amplitude at 217 GHz (which 
dominates over the Poisson point source amplitude over much of 
the multipole range), they have only marginal sensitivity to the 
tSZ amplitude in the 100 x 100 spectrum, though the thermal SZ 
is strongly degenerate with the Poisson point source amplitude. 
The remaining foreground parameters are highly degenerate. For 
Planck alone, these minor foreground contributions combine to 
absorb inter- frequency residuals. 

Pairs of spectra are compared in Fig. 14, averaged over bands 
of width A£ = 31 below £ < 2000 and wider bands above 2000. 
(The error bars show the diagonals of the covariance matrices 
of these averages, but it is important to note that the points are 
highly correlated even with bin widths as large as these.) This 
comparison shows that each of the spectra used in the CamSpec 
likelihood is consistent with the best-fit theoretical spectrum to 
high accuracy. In fact, each spectrum can be used to form a like- 
lihood, and each gives a reduced^ 2 close to unity (see Table 4). 
Thus, the six parameter ACDM model provides an excellent fit 
to the Planck high-/ 7 power spectra at all frequencies between 
100 and 217 GHz. 



Figure 15 shows our maximum likelihood primary CMB 
spectrum, together with the best-fit theoretical spectrum. The re- 
siduals with respect to the model are shown in the lower panel. 
The error bars are computed from the diagonal components of 
the band-averaged covariance matrix. The binning scheme is the 
same as in Fig. 14. 

Finally, in Fig. 16 we zoom in on this spectrum in four multi- 
pole ranges using finer binning. The correlated fluctuations seen 
in this figure are mask-induced, and perfectly compatible with 
the six parameter ACDM model. Features such as the 'bite' 
missing from the third peak at £ ~ 800 and the oscillatory fea- 
tures in the range 1300 < £ < 1500 are in excellent agreement 
with what we expect from our covariance matrices and from sim- 
ulations; see Appendix A.4 for a few specific examples. 



6. Accuracy assessment of the high-/ 7 likelihoods 

In this section we compare the power spectra and likelihoods 
derived using our two independent methods, and test the likeli- 
hoods using full Planck simulations. 

6.1. Comparison of the Plik and CamSpec likelihoods 

To allow a consistent comparison between the CamSpec and 
Plik likelihoods, we use the same frequency cross-spectra for 
both codes in the following, i.e., we discard the 100 x 143 and 
100 x 217 GHz frequency combinations from the default Plik 
likelihood. To achieve this, we modify the Plik likelihood to 
use the fiducial Gaussian approximation instead of the Kullback 
divergence. On the other hand, while we use the same multipole 
coverage, 100 < £ < 2500, and only one mask (CL39) for all 
cross- spectra for Plik, we still use multipole ranges and masks 
as defined in Table 4 for CamSpec. In addition, we perform one 
Plik analysis with the CL49 mask, which matches the CamSpec 
mask at 100 GHz. 

The left column of Fig. 17 shows the differences between 
the Plik power spectra, adopting the above validation settings, 
and the corresponding best-fit model. The right column shows 
the total spectra decomposed into cosmological and foreground 
components. The residuals agree with those in Fig. 13, and do 
not show any evidence of features, except for some excess power 
in the 217 x 217 GHz spectra at small scales, where foreground 
modelling has the highest impact. At scales £ < 1500, the re- 
siduals are coherent between cross spectra as they are computed 
with the same Galactic mask, and the residuals are dominated by 
cosmic variance. At smaller scales (£ > 1500) the residuals are 
dominated by noise and become uncorrected. 

In Fig. 18 we show the CMB power spectrum recovered 
by Plik estimated by removing the best-fit foreground amp- 
litudes from each cross-spectrum and computing their optimally 
weighted average, and the corresponding difference with respect 
to the best-fit ACDM model. The large scatter at low multipoles 
is expected due to cosmic variance. The residual scatter at higher 
multipoles is at the +20//K 2 level, demonstrating the good fit 
provided by the sum of the ACDM and foreground models. 
These CMB residuals can be compared to the CamSpec inverse- 
covariance weighted CMB residuals shown in Fig. 15, which are 
of the same order of magnitude. Thus, the Planck likelihood fit 
to the ACDM model is robust with respect to the detailed shape 
of the likelihood, as quantified in terms of power spectrum re- 
siduals. 

The ACDM parameter constraints derived from the two like- 
lihoods are shown in Fig. 19, while Fig. 20 shows the fore- 
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Figure 13. Foreground model over the full range of HFI cosmological frequency combinations. The upper panel in each plot shows 
the residual between the measured power spectrum and the 'best-fit' primary CMB power spectrum, i.e., the unresolved foreground 
residual for each frequency combination. The lower panels show the residuals after removing the best-fit foreground model. The 
lines in the upper panels show the various foreground components. Major foreground components are shown by the solid lines, 
colour coded as follows: total foreground spectrum (red); Poisson point sources (orange); CIB (blue); thermal SZ (green). Minor 
foreground components are shown by the dotted lines: kinetic SZ (green); tSZXCIB cross correlation (purple). The 100 x 143 and 
100 x 217 GHz spectra are not used in the CamSpec likelihood. Here we have assumed rfnoxi43 = * anc * r foox2i7 = ^• 



15 




Figure 14. Comparison of pairs of foreground subtracted cross spectra, demonstrating consistency of the residuals with respect to 
the best-fit theoretical model. The red line in each of the upper panels shows the theoretical six parameter ACDM spectrum for 
the best- fit parameters listed in Table 8. The lower panels show the residuals with respect to this spectrum, together with error bars 
computed from the diagonal components of the co variance matrices of the band averages. The points here are band- averaged in bins 
of width A^~ 31. 



ground parameters. For the cosmological parameters, the agree- 
ment between the two likelihoods is excellent: when the CL49 
mask is adopted for the Plik likelihood, which is also used by 
CamSpec at 100 GHz, all parameters agree to 0.2cr in terms 
of maximum posterior values. We also see that the widths of 
the distributions are quite similar, with the Plik ones slightly 
broader than the CamSpec ones. Significantly larger differences 
are seen for the foreground parameters. 

These effects can be understood as follows: we use the 
CamSpec likelihood with Galactic mask CL49 for the 100 x 
100 GHz spectra, to minimize the cosmic variance in the low 



to intermediate multipole range {£ < 1200), taking advantage 
of the low level of Galactic emission in this channel. At higher 
multipoles we use the more conservative Galactic mask CL31 
for both the 143 and 217 GHz channels, at the price of a higher 
variance. However, in the specific case of the ACDM model con- 
sidered here, most of the constraints on cosmological parameters 
come from relatively modest multipoles, £ < 1500, rather than 
from the damping tail. This explains why when we repeat the 
Plik analysis, enlarging the sky area from Galactic mask CL39 
to CL49, we find parameter distributions in good agreement with 
those of CamSpec. However, the detailed choice of masks, as 
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Figure 15. 7b/?: Planck maximum-likelihood (primary) CMB 
spectrum compared with the best-fit six parameter ACDM spec- 
trum. Bottom: Power spectrum residuals with respect to the 
ACDM model. The error bars are computed from the diagonal 
elements of the band-averaged covariance matrix, as given by 
Eq. A.25, including contributions from foreground and beam 
transfer function errors. 



well as multipole range cuts, does affect the foreground para- 
meters. This is investigated further in Sect. 7. 

Figure 20 shows the foreground parameters estimated from 
both Plik and CamSpec. We consider the case for Planck data 
alone, and with the inclusion of data from ACT and SPT. We 
also impose, for CamSpec, a Gaussian prior of 0.7 ± 0.2 on the 
CIB slope parameter, y CIB . We find that the upper bounds on 
the CIB amplitude at 143 GHz, A™, and on the SZ amplitudes 
(both thermal and kinetic, A tSZ and A kSZ ) are in good agreement 
using Planck data alone, but we see differences (~ 1.5<x) in the 
CIB and Poisson amplitudes at 217 GHz (A™, Aff 7 ) as well as 
a difference in the CIB correlation coefficient, r™ x217 . 

We understand this effect in the following way: CamSpec 
uses a more limited multipole range and a more conservative 
mask at 217 GHz than Plik, in order to minimize the Galactic 
emission in this channel. This enhances the degeneracy between 
the foreground parameters at 217 GHz, and enhances the sensit- 
ivity to possible deviations of the CIB power spectrum from the 
pure power law assumed here. This is artificially enhanced by 
normalizing the components at £ = 3000, which is more suit- 
able for high resolution experiments than for Planck. Despite 
the disagreement of the precise decomposition into the different 
physical components, the sum of the foreground contributions 
at 217 GHz in Plik and CamSpec is in good agreement. When 
ACT and SPT data are added, these problems are largely allevi- 
ated, as shown in Fig. 20. 

Are these differences important for cosmology? To address 
this question, we examine the covariance between cosmological 
and foreground parameters. Figure 21 shows the correlation mat- 
rix for all the estimated parameters. The basic ACDM paramet- 
ers have well-known correlations: the scalar spectral index, n s , 
is anti-correlated with both the amplitude A s and the dark mat- 
ter density £2 c /z 2 , which is itself correlated with the amplitude. 
Within the foreground parameters, there are strong correlations 
between the Galactic dust amplitude, the CIB amplitudes, and 
the SZ amplitude, as well as between point source amplitudes at 



different frequencies. These correlations result from the conser- 
vative foreground model adopted here, where all amplitudes of 
the CIB and Poisson contributions are left free to vary in each 
frequency pair; this choice results in small residuals in the fits, 
at the price of partial degeneracies between the foreground para- 
meters. 

Despite this conservative foreground model, there is a small 
correlation between cosmological and foreground parameters, 
the strongest effect being a 34% anti-correlation between the 
kinetic SZ amplitude and the scalar spectral index. The kinetic 
SZ power spectrum amplitude is positive, so marginalizing over 
it affects the peak value of n s despite the fact that A kSZ is not sig- 
nificantly different from zero. The addition of smaller scale data 
from ACT and SPT helps to break this degeneracy. In addition, 
as can be seen in Fig. 20, the posterior distributions of CamSpec 
and Plik for A kSZ are in good agreement, showing the stability 
of the cosmological parameters to the likelihood method. 

In Sect. 7, we will further explore the stability of the cosmo- 
logical and foreground distributions to technical choices made 
in the likelihood and data selections. 



6.2. Comparison to simulations 

It is important to demonstrate the precision and accuracy with 
which cosmological parameters, and foreground parameters to a 
lesser extent, can be recovered on realistically simulated data. 
Here we compare the posterior distributions of cosmological 
and foreground parameters, together with calibration and beam 
error parameters, inferred using the Plik likelihood, with the 
input values of a set of simulations, referred to as 'Full Focal 
Plane' (FFP6). The signals in these simulations are based on the 
'Planck Sky Model' (Delabrouille et al. 2012) which includes a 
detailed model of the astrophysical emission, both Galactic and 
extragalactic, at the Planck frequencies. The simulations also 
reproduce in detail the main instrumental systematic effects of 
Planck, including correlated timeline noise, instrumental point- 
ing, flags, anisotropic detector beams, and spectral bandpasses. 
One thousand CMB and noise realisations were generated using 
the same foreground emission, and a hundred realisations were 
performed at the level of different detector sets. These simula- 
tions are described further in Planck Collaboration ES (2013). 

In order to test for any methodological bias, we estimate 
cosmological parameters from 100 FFP6 simulations using the 
Plik likelihood. They consist of random realisations of CMB 
anisotropies and noise, superimposed with a single realisation 
of a frequency-dependent foreground template. We assume the 
foreground power spectra to be known exactly, and include their 
additional power as a constant component in our model. We do 
not estimate the foreground parameters in this case. This goal of 
this test is to demonstrate the reliability of our analysis pipeline 
and explore the effects of the noise and CMB -foreground chance 
correlations on parameter estimation. 

To compare our results to the simulation inputs, we also es- 
timate cosmological parameters from the 100 CMB realisations, 
without including noise or foregrounds. In the likelihood evalu- 
ations we down- weight the flight part of the power spectrum as 
if there were noise at the level of the noise simulations. A direct 
comparison of the derived parameters from this CMB -only ana- 
lysis and the full simulation (that contains noise and foreground 
emission) allows us to remove the scatter introduced by cosmic 
variance. 

The result of the analysis is shown in Fig. 22, where we plot 
the distribution of the difference of the mean estimated paramet- 
ers, between the CMB -only simulation and the noisy CMB simu- 
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Figure 16. Zoom-in of regions of the Planck primary CMB power spectrum using finer bin widths (A£ = 15 for I < 1000 and 
At - 7 for 1000 < i < 2200. In the upper panels, the red lines show the best-fit ACDM spectrum, and the blue lines join the Planck 
data points. Error bars are computed as in Fig. 15. 



lation with foregrounds, in units of the standard deviation of each 
individual distribution. An unbiased pipeline will give a differ- 
ence consistent with zero. Averaged over 100 simulations, only 
the bias on £l c h 2 is statistically significant, but is < 0.3cr. An 
interesting point to note from these histograms is that the distri- 
butions are rather wide, mostly around 0.5<x, while by construc- 
tion, we only studied the effect of noise and CMB -foreground 
chance correlations in this test. 

The FFP6 simulations can also be used to assess the sensit- 
ivity to foreground modelling errors. In Fig. 23, we compare the 
posterior marginal distributions of cosmological parameters, es- 
timated with different assumptions about the foreground model. 
The blue lines, which correspond to the analysis of CMB -only 
simulations (but accounting for the noise covariance in the like- 
lihood) correspond to the idealised case where the foregrounds 
play no role, and where the noise-induced variance has been av- 
eraged. The red, thick lines show results obtained when margin- 
alising over the parameters of the model of foregrounds that is 
applied to the Planck data (see Sect. 3), with a fixed value of the 
CIB spectral index y cm . Purple and grey lines show respectively 
the effect of leaving y cm free when marginalising, or fixing it to 
the displaced value of 0.4, more than 2<x away from the peak pos- 
terior. The green lines show the effect of leaving y Dust (spectral 
index of the Galactic dust emission) free in the marginalisation. 



The distributions are all in reasonable agreement with the 
input parameters of the simulation. In addition, we see that vary- 
ing assumptions on the parameters of the foreground model (red, 
green, purple, and yellow lines) have negligible impact on the re- 
covered cosmological parameters. Finally, the broadening of the 
posteriors between the CMB -only exploration and the full case 
(including noise random realisations and foregrounds) is expec- 
ted, as the latter includes all the sources of variance, including 
CMB -foreground and CMB -noise chance correlations. 

It is worth noting that the FFP6 foreground simulations, 
based on extrapolations of existing observations, cannot be de- 
scribed by the simple foreground model used in the likelihood 
analyses. The upper bounds on biases introduced by a possible 
mismatch between the simulated foreground templates and the 
model used in the analysis, inferred from Fig. 22, should be rep- 
resentative of the Planck data analysis. The negligible impact of 
the various assumptions made on the foreground model paramet- 
ers of Fig. 23 confirms that the cosmological parameter estima- 
tions are robust to details of the foreground model. 

7. Consistency checks 

In this section we investigate the stability of the distributions of 
cosmological and foreground parameters. The technical choices 



18 



Planck collaboration: CMB power spectra & likelihood 




2500 



2500 



2500 



10 4 
10 3 
10 2 
10 1 
10° 
10 1 

10 4 
10 3 
10 2 
10 1 
10° 
10 1 



■ 100 x 100 


data 
— model 
— - cmb 


cib and sz 

— poisson 

— galactic dust 


— ksz 

- subpixel correction . - 
■ - ■ - cross det correction 
























/ 





2500 





10 4 r 




10 3 r 




10 2 r 




10 1 ; 




10° r 




10 




10 4 r 




10 3 r 


~k 


10 2 r 




10' 




10° r 




10 







500 



1000 



1500 



2000 



2500 



• 143 x 143 














- 








:-' ^ ^ - 















500 



1000 



1500 



2000 



2500 




500 



1000 



1500 



2000 



2500 




2500 



Figure 17. Left: Residuals between the Planck power spectrum derived with Plik using 'validation' settings (described in the text) 
and the best-fit model. The light grey lines show residuals for individual detector pairs within each frequency combination. The 
blue lines show the inverse co variance weighted averages of the individual residuals, together with their errors computed from the 
covariance matrix. Right: Decomposition of the total best-fitting model power spectra into CMB, combined thermal SZ and CIB, 
unresolved point sources, kinetic SZ, and Galactic dust. 



made in constructing the high-^ likelihood fall into three broad 
categories. The first category covers internal parameter choices 
that leave the data selection unchanged. This includes choices 
such as the binning strategy, marginalizing or not over calibra- 
tion and beam errors, and the description of the noise model. 
The second category includes variations in the data selection, 
such as the multipole range used, and the choice of masks. The 
final category accounts for variation in the foreground model. 
We perform a suite of tests to investigate the impact of these 
choices on parameters. We use the Plik likelihood, and all tests 
are compared to the baseline Plik spectra. Most of the res- 
ults can be summarized by 'whisker plots' which compare the 
main properties of the posterior distribution of the cosmological 



and foreground parameters. More detailed results are reported in 
Appendix D.l. 

In this section we also compare our estimated cosmological 
parameters to those derived from spectra computed from the LFI 
70 GHz channel. We additionally check the consistency of para- 
meters with results obtained using the power spectrum of CMB 
maps derived by component separation methods (described in 
Planck Collaboration XII 2013) that use Planck data at all fre- 
quencies. This battery of tests demonstrates the stability of the 
inferred cosmological parameters. 

A final test is to compare the predicted polarisation spec- 
trum of the best fitting ACDM model with spectra measured 
from Planck. As discussed in Planck Collaboration I (2013) and 
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Figure 18. Top: CMB power spectrum estimated using the 
Plik likelihood, after subtracting the best-fit foreground model. 
Bottom: Residual difference (blue line) between the Plik spec- 
trum and the best-fit ACDM model. The solid green line shows 
the difference between the best-fit models derived from the Plik 
and CamSpec likelihoods. 



Planck Collaboration VI (2013), the Planck polarisation data is 
not yet used in our cosmological analysis, as further tests must 
be performed, but the current results increase our confidence in 
the robustness of the high-£ temperature likelihood. 

7.1. Impact of technical choices for fixed data selection 

Here we consider three changes: (1) fixing the inter- frequency 
calibration and beam errors to the best-fit values, rather than 
marginalizing, (2) including sub-pixel effects, and (3) including 
noise correlation between detectors. Figure 24 shows the corres- 
ponding impact on parameters. 

The effect of fixing the calibration and beam errors is negli- 
gible on most cosmological parameters within the six parameter 




Figure 19. Comparison of cosmological parameters estimated 
from the CamSpec (blue) and Plik (red for mask C48; purple 
for mask C58) likelihoods. All parameters agree to better than 
0.2cr when C58 is used for Plik, matching the sky area used by 
CamSpec at 100 GHz. 
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Figure 20. Comparison of foreground parameters estimated with 
the CamSpec and Plik likelihoods. The red (purple) lines show 
the CamSpec (Plik) distributions using only Planck data, and 
the green (blue) lines show the CamSpec (Plik) results when 
additionally including ACT and SPT. 



ACDM model, with the exception of n s where we see a 0.1 6<x 
shift. There is a bigger effect on the foreground parameters due 
to their partial degeneracy, and their subdominant contribution 
to the total power. There is only a small correlation between 
the cosmological parameters and the calibration coefficients, so 
marginalizing or fixing their value has little impact on the cos- 
mology. 

The calibration coefficients are, however, strongly correl- 
ated with each other, in particular at 217 GHz, since they are 
also significantly correlated with e.g., the CIB amplitudes in the 
217 x 217 and 217 x 143 GHz spectra. This is important to keep 
in mind when comparing the calibration estimates obtained here 
with those obtained from the CMB dipole in the HFI data pro- 
cessing paper (Planck Collaboration VI 2013). Nevertheless, as 
can be seen in Fig. 26, the peak posterior values of the relative 
calibration coefficients are found to differ from 1 at most at the 
few parts per thousand for all the 13 detectors sets involved, in 
agreement with the estimates of the calibration accuracy of the 
maps (Planck Collaboration VI 2013), although a wide flat prior 
has been applied on these coefficients. The same test applied on 
simulations with no beam or calibration errors shows how well 
this test is passed. This confirms that the deviations found at the 
0.1% are significantly detected, and it is important to show that 
these deviations have little impact on the cosmology. 

The estimated values of the beam errors do not imply that 
extra beam corrections are required, with the possible exception 
of the 100 - dsl and 143 -7 detectors (although the correspond- 
ing marginal distribution is far from Gaussian). A comparison of 
the prior and posterior distributions suggests that we have con- 
servatively estimated the uncertainties from the beam determin- 
ation. We keep this conservative approach in CamSpec, in which 
we marginalise analytically over all beam eigenmodes except for 
the dominant 100 GHz mode, which we sample directly. 

As an extended test, we investigate the effect of possible er- 
rors in the beam transfer function when the helium abundance 7p 
is also allowed to vary freely (i.e., without imposing constraints 
from Big Bang Nucleosynthesis), as it has a larger effect on the 
small scale spectrum. Varying this parameter leads to a substan- 
tial broadening of the posterior distributions for fl^h 2 , #mc> and 
n s , as can be seen in Fig. 27. We confirm that marginalizing over 
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Figure 21. Correlation matrix between all the cosmological (top block), foreground (middle block), and derived (bottom block) 
parameters, estimated using the Plik likelihood. 



calibration and beam errors has a small impact on all cosmolo- 
gical parameters, including Fp. On the other hand, it has a some- 
what larger impact on some of the foreground parameters (see 
Appendix D.l). 

We then investigate the impact of two subdominant effects: 
the "sub-pixel effect" and the possible presence of a correlation 
in the noise between detectors or detector sets. The sub-pixel 
effect has a convolutive effect on the power spectra that is sim- 
ilar to gravitational lensing of the CMB, but is purely a result 
of the Planck scanning strategy and the map-making proced- 
ure (Planck Collaboration VII 2013). The scanning strategy on 
rings with very low nutation levels results in the centroid of the 
samples being slightly shifted from the pixel centres; however, 
the map-making algorithm assigns the mean value of samples 
in the pixel to the position at the centre of the pixel. This has 
a non-diagonal effect on the power spectra, but the correction 
can be computed given the estimated power spectra for a given 



data selection, and recast into an additive, fixed component of 
the model co variance matrix. 



The possible noise correlation between detectors may appear 
due to factors such as common residual thermal fluctuations, 
electronic chain noise, or cosmic ray showers. To build a model 
of this correlated component, we compute the cross-power spec- 
tra between detectors of difference maps that are free of sig- 
nal. This procedure should capture all correlations on times- 
cales shorter than half a ring's observation. These estimates are 
noisy, so we compute an average amplitude of the correlation for 
€ > 1000. 



We find that the impact on parameters of both these effects 
is negligible, with less then a 0.1 cr effect. 
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Figure 22. Distribution of the difference between parameter 
mean (as estimated by Plik assuming foreground spectra are 
known) and the input values in units of the posterior standard 
deviation. This demonstrates the absence of methodological bias 
at the level of the intrinsic dispersion between realizations. 
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7.2. Impact of data selection 

Here we consider three changes: (1) varying the angular range 
used in the likelihood, (2) varying the Galactic mask, and (3) 
discarding individual frequency channels. These are expected to 
result in changes in the parameter distributions due to the fact 
that we are changing the input data. 

We first vary the maximum and minimum multipole. Using 
^max = 1008 gives parameter distributions of similar width to 
those obtained by WMAP. We find that all basic ACDM cos- 
mological parameters have converged by £ max ^ 1500, since 
no parameters are specifically sensitive to the damping tail. The 
convergence of the posteriors on foreground parameters is, as 
expected, slower when increasing £ max , as most of them are 
dominant at small scales. The Galactic dust normalization de- 
creases with £ max due to its correlation with the CIB components. 
Similarly, changing from 100 to 50 has a negligible effect 
on cosmological parameters, and mostly affects the determina- 
tion of the Galactic dust amplitude, which decreases for £ m { n as 
it is better measured on large scales; its correlations with the CIB 
clustered and Poisson contributions explain the slight variations 
in the corresponding parameters A™, y CIB , and A^ 7 . 

We then investigate the impact of varying the Galactic mask, 
from the most conservative (CL20) to the least conservative 
(CL49), for fixed multipole range 100 < I < 2508. As expected, 
the errors decrease as the sky fraction increases. From CL20 to 
CL39, cosmological and foreground parameters are stable. The 
foreground parameters change significantly however when we 
use CL49 for all channels, showing that our foreground model 
is unable to properly fit the data: the clearest sign of this failure 
is the unphysically low value of the CIB spectral index (y CIB ), 
indicating that our CIB component determination is getting con- 
taminated by a dust-like component with a steeper angular power 
spectrum than the CIB, but shallower than that of our (single) 
Galactic dust component. At low Galactic latitudes, the pres- 
ence of compact Galactic sources leads to a flatter angular power 
spectrum than that of (high-latitude) diffuse thermal dust, and 
this likely effects the CIB determination. This justifies the choice 
made in CamSpec to use a conservative masking strategy (CL31) 
for the 143 and 217 GHz channels. 

Next, we remove one frequency channel at a time. Results 
change by less than 0.5<x except when removing the 211 GHz 
channel. This removes a large part of the information and 
amounts to retaining only 21 cross-spectra out of the 78. 



7.3. Testing the Foreground Model 

In this section we describe various tests of the foreground model 
used in the likelihood, checking the validity of our model for the 
extragalactic sources. Further tests are also reported in Appendix 
B of Planck Collaboration XVI (20 1 3). 



Figure 23. Posterior marginal distributions of cosmological 
parameters obtained by Plik on a single FFP6 realisation, under 
different assumptions on the foreground model. Red thick lines 
correspond to the foreground model parametrisation described 
in Sect. 3 with fixed y CIB . Purple lines (resp. green, grey) show 
the effect of leaving y cm free (resp. letting y Dust free, or fixing 
y CIB to 0.4, 2<r away from the peak posterior value). Finally, the 
blue lines correspond to the analysis made on CMB -only simu- 
lations. The input values of the parameters of the simulation are 
shown as dashed vertical lines. 



7.3.1 . Poisson power from extragalactic sources 

Here we check that the Poisson power estimated in the like- 
lihood, which comes from sources below Planck's detection 
threshold, is consistent with the level expected given number 
counts of detected galaxies. Figure 28 shows source counts 
from Planck (Planck Collaboration Int. VII 2013) at 100, 143, 
and 217 GHz derived from the Planck Early Release Compact 
Source Catalogue (Planck Collaboration et al. 2011b,a). At 143 
and 217 GHz we also show the source counts from SPT as repor- 
ted in Vieira et al. (2010) at 150 and 220 GHz, and from ACT 
(Marriage et al. 201 1) at 150 GHz. The models of de Zotti et al. 
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Figure 24. Comparison of the distributions of cosmological parameters in the reference case (left) with a set of validation test cases. 
The red line indicates the median and blue the mean, computed from the posterior histograms. The box shows the 68% confidence 
interval; the outer line the 95% interval. 



(2005) and Tucci et al. (2011) are also shown and are discussed 
in Planck Collaboration Int. VII (2013). 

Planck Collaboration Int. VII (2013) use spectral informa- 
tion to separate the sources into 'synchrotron' and 'dusty' 
sources, and show that the counts at 100 - 217 GHz are domin- 
ated by synchrotron sources at flux densities above ~ 400 mJy. 
Vieira et al. (2010) performed a similar separation. The counts 
at 150 GHz are dominated by synchrotron sources at flux densit- 
ies S > 10 mJy, but dusty galaxies contribute roughly equally 
at 220 GHz at flux densities < 30mJy (Vieira et al. 2010; 
Hall et al. 2010). The ACT counts have not been separated ac- 
cording to spectral type, but should be dominated by radio 
sources at these flux densities. 

Figure 28 show models fit to the counts using the function 



7 5/2 



dN 



AS 5 ' 2 



ds (s/Soy + (s/s y 



+ 5 1- exp 



(20) 



where A, So, a, p, B, and S\ are free parameters. The best-fit 
values of these parameters are given in Table 7. 

Given this model, and given the approximate flux cut applied 
to the Planck maps, the expected contribution of radio sources 
to the Planck power spectra, at flux densities smaller than 400, 



Table 7. Parameters for point source model of Eq. 20, fitting 
detected source counts shown in Fig. 28. 



Parameter 


100 GHz 


143 GHz 


217 GHz 


A 


18.24 


8.38 


8.58 


So 


1.58 


1.65 


1.48 


a 


1.88 


1.89 


1.90 


P 


3.35 


3.78 


4.10 


B 


14.91 


8.73 


8.53 


Si 


. . . 14.91 


5.17 


1.78 



350, and 225 ± 50 mJy at 100, 143, and 217 GHz, are 8.47 ± 1, 
6.05 ± 0.8, and 3.10 ± 0.7 Jy 2 /sr. The contribution of unresolved 
infrared galaxies to the power spectra is not negligible. They are 
expected to dominate at 217 GHz, even if they are subdominant 
in the Planck counts. Indeed, faint IR galaxies create a "bump" 
in the S 5/2 dN/dS distribution, below the detection limit of ACT 
or SPT. This bump is seen at higher frequencies, e.g., with the 
Herschel SPIRE instrument (see Planck Collaboration Int. VII 
2013, for details). 

This bump of infrared galaxies has not been measured at fre- 
quencies of 217 GHz and below. However, measurements with 



23 



Planck collaboration: CMB power spectra & likelihood 



m 

^ 20 
10 
|| 

1.0 
0.8 



350 
300 
250 
wg 200 
150 



350 
300 
250 
200 
150 
100 
50 




T T 



All 



T HH 



T T T 



111111 



T T T T T T 



1: 



HH T t 











- i 

ft ^ { 


j * * * * $ 








Km 



200 
. 150 



0.00012 
0.00010 
I 0.00008 
<3 0.00006 
0.00004 
0.00002 
0.00000 





1 

^ B 


, i 






nr 1 




1 














: V 




: t\h : 














^ ^ ^ =i= ^ ^ ^ — 







Figure 25. Comparison of the distributions of foreground model parameters, as in Figure 24. 



the AzTEC telescope at 1.1mm (270 GHz, Scott et al. 2012) 
can be used to extrapolate the counts down to 217 GHz. This 
leads to a predicted peak in the number counts (S 5/2 dN/dS) 
around 1.4 mJy at a level of 190 Jy 15 /sr, somewhat higher 
than the values in Hall et al. (2010). The corresponding contri- 
bution of infrared galaxies to the power spectrum is estimated in 
Planck Collaboration XVIII (2011) to be 16 Jy 2 /sr at 217 GHz 
but with significant uncertainty. 

Summing the expected contributions from radio and IR 
galaxies, we estimate the following values for D3000 for Planck: 
200, 75, and 120 fiK 2 at 100, 143, and 217 GHz respectively. 
These predictions are much less certain at 217 GHz due to the 
absence of infrared galaxy counts at this frequency. 



7.3.2. Clustered power in CIB fluctuations 

Here we check the consistency of the estimated clustered power 
in CIB fluctuations. As already noted, from the CamSpec and 
Plik likelihoods we find only an upper limit on the clustered 
CIB power at 143 GHz. With CamSpec we detect clustered 
power at 217 GHz with A™ = 32+10 yuK 2 , and Poisson 
power with A^ 7 = 92 + 22 yuK 2 . This Poisson power is dom- 
inated by the CIB fluctuations. For the Plik likelihood, we have 



A™ =49 + 7 yuK 2 and A*f 7 = 58 + 19 yuK 2 . The sum of the CIB 
power at 217 GHz, and at pivot scale i - 3000, is in the range 
« 105 - 125 yuK 2 . 9 

We compare this level to the measurements by the ACT 
and SPT experiments, which probe higher angular resolution. 
Fitting a common model to the ACT power spectra from 
Das et al. (2013), and the SPT spectra from Keisler et al. (201 1); 
Reichardt et al. (2012), the analysis in Dunkley et al. (2013) 
54+16 yuK 2 for the CIB clustered component 



finds A™ 6 



(and A 



PS,CIB 
219.6 



78 + 12 yuK 2 for the CIB Poisson component) at 



effective frequency 219.6 GHz. For SPT, the clustered level is 
A™ 6 = 59+12 yuK 2 (and Poisson A^ IB =69 + 10 yuK 2 ), also 
at an effective frequency of 219.6 GHz. This is consistent with 
the SPT analysis in Reichardt et al. (2012). These are estimated 
assuming y cm =0.8 and r™ x2 n ~ 1» anc * that the CIB emis- 
sion can be modeled with frequency as a modified blackbody, 
following Addison et al. (2012b). 

The total CIB signal seen by Planck, extrapolated to I - 
3000 scales, is therefore consistent with the ACT and SPT ob- 



9 Note that Planck does not measure directly the CIB power at the 
pivot scale £ = 3000, hence these extrapolated values are sensitive to 
possible shape mismatch of the clustered CIB fluctuation power spectra 
at lower multipoles. 
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Figure 26. Estimated calibration and beam eigenmode parameters, compared to the priors, using (left) 100 to 217 GHz data, and 
(right) a single 'FFP6' simulation where there was no calibration or beam errors. The shaded area shows the width of the prior 
imposed on the first beam eigenvalue. 
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Figure 27. Posterior distributions for cosmological parameters, 
varying the Helium abundance Yp (red). We show the effect of 
marginalizing over calibration and beam errors (blue), and the 
results obtained using the reference settings (purple), where Y P 
is constrained by BBN. 



servations, but given the limited angular range of Planck, the 
clustered and Poisson part are degenerate. This motivates us to 
include the ACT and SPT data in many of our cosmological ana- 
lyses. 

When combining Planck, ACT and SPT data together, using 
the same foreground model (except for Poisson power which 
depend on the respective flux cuts of the experiments), both 
CamSpecandPlikgiveA™ = 50 + 5 yuK 2 ,A^ 7 = 60+10 yuK 2 , 
A™ = 32 + 8 yuK 2 , A™ 3 = 75 + 8 yuK 2 , andA^ = 220 + 53 yuK 2 . 
These estimates of the Poisson power are in good agreement 



with the predictions given in Sect. 7.3.1 for the 100 and 143 GHz 
channels. In the latter, radio sources below the Planck flux cuts 
dominate the Poisson power, which can be reliably estimated 
from existing source counts measurements. 

We also consider modifying our model for the clustered part 
of the CIB. There have been a wealth of CIB models (e.g., 
Knoxetal. 2001; Amblard & Cooray 2007; Halletal. 2010; 
Penin et al. 2012; Addison et al. 2012a), most assuming that the 
dust is a biased tracer of the dark matter distribution, but differ- 
ing in their parametrization of the dust emissivity and its evol- 
ution, and their treatment of the dark matter power spectrum. 
Recent papers (Planck Collaboration et al. 201 lc; Addison et al. 
2012a) have shown that the addition of Planck CIB measure- 
ments, when combined with other small scale probes including 
SPT, ACT, BLAST, and Herschel, rule out models that assume 
the underlying dark matter power spectrum is linear. 

We therefore test a set of models that try to simultaneously fit 
the non-linear spectrum with one or more template spectra. We 
consider fixing the scale dependence to a power law, either £°- 8 or 
£° - 6 , or extending the power law model to have a running of the 
index. The £°- 8 model has been used in Shirokoff et al. (2010); 
Reichardt et al. (2012); Addison et al. (2012b); Dunkley et al. 
(2013), while the £ - 6 more closely matches the CIB model of 
Addison et al. (2012a). We find that a simple power law does 
not fit both the Planck and high-/ 7 data sufficiently well, but that 
allowing the additional freedom of a running spectrum opens up 
the parameter space too much, with little improvement in good- 
ness of fit, motivating our use of the varying y cm model. 

We also test a simple model using just a linear theory dark 
matter power spectrum, assuming that the non-linear power can 
be absorbed into the Poisson term. This results in an estimate of 
the Poisson level at 217 GHz that is inconsistent with ACT and 
SPT, so we do not use this model. For all these models we test the 
effect on cosmological parameters, using the Hubble constant as 
a test case, and find the effect on parameters to be small. This is 
also investigated in Planck Collaboration XVI (2013). 

7.4. Consistency of the Planck 70 GHz data 

Figure 1 1 shows individual frequency spectra from 70 to 
353 GHz. We only use data from the 100 to 217 GHz channels 
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Figure 28. Number counts from Planck (filled circles, Planck Collaboration Int. VII 2013), ACT, and SPT (open squares) as 
described in the text, from 100 GHz to 217 GHz. The models from de Zotti et al. (2005, solid line) and Tucci et al. (2011, dots) are 
overplotted. The analytical fit from Eq. 20 and Table 7 is shown dashed, and shows a similar behaviour to the de Zotti et al. (2005) 
model. 



to form the high-^ likelihoods, but here we compare cosmolo- 
gical parameters derived from the Planck 70 GHz channel alone. 
Maps at 70 GHz are easier to characterize than the higher fre- 
quency channels in terms of instrumental and foreground prop- 
erties, but the resolution and sensitivity are lower. The 70 GHz 
noise properties are in general well described by a simple three 
parameter model involving 1/f and white noise contributions 
(Planck Collaboration II 2013). The 70 GHz channel also has 
the least diffuse foreground emission (Planck Collaboration XII 
2013), and the extragalactic source contribution is dominated by 
radio galaxies whose emission is well known at these frequen- 
cies (Planck Collaboration XXVIII 2013). We adopt a Galactic 
plane cut leaving ~ 70% of the sky for the analysis (CS70) 
to which we add a point source mask optimized for 70 GHz. 
In Appendix D. 2 we describe how cosmological parameters are 
estimated from the 70 GHz channel, which are summarised in 
Fig. 24. Accounting for the lower sensitivity and angular resolu- 
tion at 70 GHz, which translate into a narrower multipole range 
(i < 1200), the parameter distributions are consistent with the 
reference values. 

7.5. Consistency with power spectra of CMB maps obtained 
by component separation methods 

The likelihoods we consider in this paper account for com- 
ponent separation by modeling the multi-frequency data at the 
power spectrum level, to fully exploit the signal at the smal- 
lest scales probed by Planck. We can compare the results to 
those derived from an alternative approach, measuring the power 
spectrum of CMB maps estimated from component separation 
techniques. Here we present results obtained with four CMB 
maps, derived using methods referred to as Commander-Ruler, 
SMICA, NILC, and SEVEM, described in detailed in the ac- 
companying paper (Planck Collaboration XII 2013). We com- 
pare their angular power spectra and cosmological parameters 
with those from CamSpec. To estimate the power spectra we 
use the XFaster method, an approximation to the iterative, max- 
imum likelihood, quadratic band power estimator based on a di- 
agonal approximation to the quadratic Fisher matrix estimator 
(Rocha et al. 2009, 2010). The noise bias is estimated using dif- 
ference maps, as described in Planck Collaboration XII (2013). 
The resulting spectra are shown in Fig. 29, and agree well out to 
scales £ max = 2000, even though the agreement is less striking 
for the Commander-Ruler small-scale spectrum. 

We then estimate cosmological parameters using a Gaussian 
correlated likelihood derived from these band-powers. To model 
the residual extragalactic foregrounds, we consider two nuisance 
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Figure 29. Top: CMB power spectrum, with best-fitting fore- 
ground model removed, compared to the CamSpec best fit model. 
Middle: the residuals with respect to this best-fit model. Bottom: 
residuals of the best-fit models from the map-based likelihoods, 
with respect to the CamSpec best fit model. 

parameters: A ps , the amplitude of a Poisson component, and Aq, 
the amplitude of a clustered component, scaling a term with 
shape Di oc £ - 8 . Figure 30 compares the parameters obtained as 
a function of £ max for each method, compared to the values from 
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the analysis, we applied CL39 to the temperature maps, and dis- 
carded 60 % of the polarisation data (i.e., / S k y = 0.4). Other than 
masking, no efforts have been made to subtract foreground con- 
tributions or take into account instrumental effects such as leak- 
age. Despite the substantial masking applied, we see evidence 
for residual contributions of non-cosmological origin. Besides 
demonstrating the potential of Planck to deliver high quality po- 
larisation maps and spectra (with the limitations explained in 
Planck Collaboration VI 2013), the figure demonstrates the high 
level of consistency of these polarisation spectra between them- 
selves, and with the prediction from the model fit using just the 
temperature spectrum shown in Fig. 15. 

As discussed in Planck Collaboration VI (2013) and 
Planck Collaboration II (2013), at present, the HFI and LFI po- 
larisation spectra at low multipoles are affected by systematic er- 
rors that cause biases which will need to be accurately modeled 
or removed for the next Planck release. However, these system- 
atics rapidly become unimportant at higher multipoles. While 
not yet fit for cosmological parameter analysis, the consistency 
at the level of a few //K of these Planck polarisation spectra adds 
to our confidence in the analysis of temperature data. It shows 
that within the ACDM framework, the cosmological parameters 
estimated from Planck temperature data are not strongly affected 
by the uncertainties in the modelling of unresolved foregrounds. 



8. Low-/ likelihood 

At low multipoles (£ < 50), the distribution of the estim- 
ated Qs is not well approximated by a Gaussian due to 
the limited degrees of freedom per £ (e.g., Efstathiou 2004). 
However, both the CMB signal, s, and instrumental noise, n, 
are individually nearly Gaussian distributed at the map level, 
provided that foreground emission and instrumental systemat- 
ics effects are negligible (e.g., Planck Collaboration XXIII 2013; 
Planck Collaboration XXIV 2013), and the actually observed 
map, m = s + n, is therefore also nearly Gaussian distributed. 
Under this assumption, the CMB power spectrum likelihood is 
given by 



Figure 30. The variation of cosmological and foreground para- 
meters estimated for the four CMB maps as function of £ max = 
1000, 1500, 2000, 2500, compared to those obtained with the 
CamSpec and Plik likelihoods. Parameters are consistent to lcr 
for Amx ^ 2000 



the high-^ likelihoods. The results are consistent to l<x or better 
for £ max = 1500 and £ max = 2000. Despite adopting a simple 
two-parameter model for the extragalactic foregrounds, the like- 
lihood using a CMB map as input data works reasonably well, 
and may be further exploited with analysis of simulations, im- 
proved extragalactic foreground modeling, and the development 
of an error model. 



7.6. Consistency with high-£ polarisation 

Our final consistency test is illustrated in Fig. 31, showing the 
polarisation power spectra derived from the Planck data. Both 
the TE and EE cross-spectra are shown, in bins of width of 
M - 40. These spectra are computed by performing a uniformly 
weighted average of all detector sets combinations at 70 x 100, 
100 x 143, 100 x 217, and 143 x 217 GHz. We use the tem- 
perature beam window functions for beam deconvolution. For 



£(Q) = P(m\Q) = 



J- — — exp ( --m ; M"'m| , (21) 

2 ^/2| M |l/2 ^ 2 I 



where n is the number of observed pixels, M(Q) = C(Q) + N is 
the data covariance matrix, and C and N are the CMB and noise 
co variance matrices, respectively. 

In the general case, the data vector 
temperature (T) and linear polarisation 
parameter maps. Pixels exhibiting high 
tamination are removed by masking, 
data vector is restricted to the subset 
m = (T h ,T h , T nT , Qj 19 Q h , ...Q np , U h , 
corresponding rows and columns are removed from M, effect- 
ively corresponding to marginalizing over the masked region of 
the sky. Note that in general, n T ± n P , and the sets of indexes 
of temperature and polarisation measurements will be different. 
We assume the same number of pixels in Q and U, although this 
is not a requirement. 

The signal covariance matrix can be written symbolically as 



m includes both 
(2, U) Stokes 
foreground con- 
such that the 
of valid pixels, 
M h ,...U np ). The 



C = 



(TT)( nTXnT ) (TQ)( nTXnp ^ {TU)( npXnp ^ 
(QT) (QQ)(n P xn P ) (QU) 

(tlpXtlp) 

JJJT)( np xn T ) (^Q)(n P xn P ) (UU)( npXnp ^ 



(22) 
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where the signal correlations for the temperature component are 
explicitly given by 

^ max 2^ + 1 

(T h T h ) = V ——QP^h) + N flfe . (23) 

Here P^ are the Legendre polynomials, and 6 ili2 is the 
angle between the centres of pixels i\ and h- Similar ex- 
pressions are available for the polarisation correlations (e.g., 
Tegmark & de Oliveira-Costa 2001). The effect of the (azi- 
muthally symmetric) instrumental beam, b^, and pixel window 
function, we, are encoded in Q = Cfb^w 2 . 

The main problem with the likelihood expression given in 
Eq. 21 is its high computational cost. This is determined by 
the matrix inversion and determinant evaluations, both of which 
scale as 0(N 3 ) with N = n T + 2n P . In practice, this approach is 
therefore limited to coarse pixelizations, N^q < 16, which reli- 
ably only supports multipoles below £ < 30. On the other hand, 
the Gaussian approximation adopted by the high-^ likelihood is 
not sufficiently accurate for the stringent requirements of Planck 
below £ < 50. In the next section, we therefore describe a faster 
low-/ 7 likelihood estimator, based on Gibbs/MCMC sampling, 
which allows us to exploit the full range up to £ < 50 with 
low computational cost, while additionally supporting physic- 
ally motivated foreground marginalization. 

Page et al. (2007) pointed out that the temperature and po- 
larisation parts of the likelihood can be separated and evaluated 
independently, under the assumption of negligible noise in tem- 
perature and in the temperature-polarisation cross correlations 
(i.e., the TQ and TU blocks of the pixel level noise co variance 
matrices). Further assuming vanishing primordial B modes and 
TB correlations, the TE correlations can be accounted for by 
redefining the modified Q and U maps as 

£max qTE I 

Q^Q-iTtTZfTi'Z* ( +2Ym + - 2 (24) 

£=2 W m=-t 
. £max qTE I 

U ^ U -^Tj7^Tj a 'm ( + 2**» ~-2 ISJ , (25) 

Z £=2 W m =-£ 



where viYi m are spin weighted spherical harmonics and a T tm are 
the harmonic coefficients of the signal in the temperature map. 
One can show by direct substitution that these modified Q and U 
maps are free of temperature correlations. The polarisation like- 
lihood can be then computed independently from the temperat- 
ure likelihood and, possibly, at lower resolution to save compu- 
tational expenses. We test this strategy in Sect. 8.2, and adopt it 
for the current release of the Planck likelihood. 

8.1. Low-£ temperature likelihood 

As discussed above, we do not implement the likelihood ex- 
pression given in Eq. 21 directly, due to its high computational 
cost and limited flexibility with respect to foreground modelling. 
Instead, we adopt the Gibbs sampling approach (Eriksen et al. 
2004; Jewell et al. 2004; Wandelt et al. 2004), as implemented 
by the Commander code (Eriksen et al. 2008), which allows 
both for physically motivated component separation and accur- 
ate likelihood estimation. A similar Gibbs sampling method was 
used to estimate the low-£ temperature likelihood for WMAP 
(Dunkley et al. 2009; Larson et al. 201 1), although not simultan- 
eously accounting for component separation. 

8.1.1. Methodology 

We start by generalizing the above data model to include both 
multi-frequency observations and a set of foreground signal 
terms, 

d y = s + ^f y + n y . (26) 

i 

Here d y denotes the observed sky map at frequency v, and f y 
denotes a specific foreground signal component. As above, the 
CMB field is assumed to be a Gaussian random field with power 
spectrum Q, and the noise is assumed Gaussian with covari- 
ance N v . The foreground model can be adjusted as needed for 
a given data set, and a full description of the model relevant for 
Planck is presented in Planck Collaboration XII (2013). In short, 
this consists of a single low-frequency foreground component 
(i.e., the sum of synchrotron, anomalous microwave emission, 
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and free-free emission), a carbon monoxide (CO) component, 
and a thermal dust component, in addition to unknown mono- 
pole and dipole components at each frequency. 

Given this data model, we map out the full posterior 
distribution, P(s,f,Q|d), using a Monte Carlo sampling al- 
gorithm called Gibbs sampling. Directly drawing samples from 
P(s,f,Q|d) is computationally prohibitive, but this algorithm 
achieves the same by iteratively sampling from each correspond- 
ing conditional distribution, 

s <- P(s|f , Q, d) 
f <-P(f|s,Q,d) 
Q<-P(Q|s,f,d). 

It is straightforward to show that P(s|f , Q, d) is a multivariate 
Gaussian distribution, and P(Q|s,f,d) is an inverse Gamma 
distribution. The foreground distribution, P(f |s, Q, d), does not 
have a closed analytic form, but can easily be mapped out nu- 
merically (Eriksen et al. 2008). Thus, all three distributions are 
associated with simple textbook sampling algorithms. 

For CMB likelihood estimation, the crucial intermediate 
product from the above sampling process is the ensemble of 
CMB sky samples, s k . Each individual sample corresponds to 
one possible CMB realization consistent with the observed data. 
In the absence of sky cuts, foreground contamination and instru- 
mental noise, this map is identical to the true sky. In that case, the 
likelihood as a function of Q is determined by cosmic variance 
alone, and given by an inverse gamma distribution, 

if- 1 

2 

r °~ f k 21+1 °-(Jc 

£ k (Q) oc -^e 2 * . (27) 
C 2 

Here we have introduced the realization specific power spec- 
trum, aw = Ylt=- m \ a £ m \ 2 > wri ere a k m are the spherical har- 
monic coefficients of s k . In the case of realistic data, we need to 
marginalize over uncertainties due to sky cuts, foregrounds, and 
instrumental noise. Hence, Eq. 27 is replaced by an average over 
all possible sampled CMB realizations, 

-Vsamp 

JXd) OC J] ( 28 ) 
k=l 

This expression is known as the Blackwell-Rao estimator 
(Chu et al. 2005), and is guaranteed to converge to the exact 
likelihood as the number of samples, Af samp , increases. Note that 
the normalization factor in this expression is unknown, but since 
the likelihood function is only used to compare different models 
through an effective likelihood ratio, this factor is irrelevant for 
actual calculations. 

8.1.2. Data selection and preprocessing 

As described in Planck Collaboration XII (2013), we include 
Planck frequencies between 30 and 353 GHz in the low-£ like- 
lihood. Each frequency map is downgraded from its native res- 
olution to a common resolution of 40', and projected onto an 
A^side = 256 HEALPix grid. Uncorrected Gaussian regulariza- 
tion noise is added to each frequency channel map, with an RMS 
proportional to the spatial mean of the instrumental noise of the 
corresponding channel, (cr y ), conserving relative signal-to-noise 
between channels. The regularization noise level at frequency 
y is 5//K • (o-y) / (<xi43GHz)- The purpose of this is to make the 
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Figure 32. Two-parameter probability distribution for an 
amplitude-tilt model constrained by the low-£ Planck likelihood 
using three different masks. Angular scales between 2 < £ < 49 
are included in this fit. 



results insensitive to unmodelled features at scales comparable 
to and beyond the smoothing scale of 40', in addition to im- 
prove the convergence speed of the Gibbs sampler. The resulting 
signal-to-noise is unity at £ ~ 400, and the additional uncertainty 
due to the regularization noise is less than 0.2 yuK 2 below £ = 50, 
and less than 1 //K 2 below £ = 100. 

To study the stability of the low-/ 7 likelihood with respect to 
sky fraction, we constructed a suite of five different masks, cov- 
ering between 81 and 100% of the sky, and for completeness 
we include the WMAP KQ85 mask, covering 75% of the sky, 
as a sixth case. These low-£ masks are distinct from those em- 
ployed for the higfi-^ likelihood, and are produced in a prelim- 
inary Commander full-sky analysis in which we estimate indi- 
vidual foreground components and residual x 2 values per pixel. 
These maps are thresholded at various levels to produce a useful 
range of sky fractions. 

For each mask, we fit a two-parameter amplitude and tilt 
power spectrum model of the form C^(q,n) = qC^f (£/£o) n , 
using the low-£ likelihood between £ m \ n = 2 and £ m2LX = 49, 
where C^ d is the best-fit Planck ACDM spectrum, and £q = 
(Ciin + dax)/2. The resulting distributions are shown in Fig. 32 
for three masks, covering 77.4 (A; WMAP KQ85), 87.5 (B) and 
97.2% (C) of the sky, respectively. The internal agreement is ex- 
cellent, with parameters differing by less than 0.3 cr between the 
very aggressive mask A and the conservative mask C. While any 
of these masks would establish an acceptable likelihood, we ad- 
opt Mask B as our fiducial mask for two reasons. On the one 
hand, the parameter uncertainties obtained with Mask B are only 
4% larger than those obtained for the minimal Mask A, indicat- 
ing that both nearly saturate the cosmic variance limit. On the 
other hand, analysis of realistic simulations indicate the pres- 
ence of statistical significant map residuals near the Galactic 
plane that are accepted by Mask A, but rejected by Mask B 
(Planck Collaboration XII 2013). The latter therefore represents 
a good compromise between rejecting foreground residuals and 
maximizing statistical power. 

We include 100000 Gibbs samples in the likelihood es- 
timator, ensuring excellent convergence characteristics for the 
Blackwell-Rao estimator for £ < 50. 
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Figure 33. Comparison of constraints on r and A s using the split 
temperature-polarization WMAP likelihood approach (dashed 
contours; Eqs. 24 and 25) with those obtained with the exact 
brute-force pixel likelihood (shaded contours; Eq. 21). 



8.2. Low-l polarisation likelihood 

The present Planck data release includes only temperature data. 
In this release, we therefore supplement the Planck likelihood 
with the 9-year WMAP polarisation likelihood 10 derived from 
the WMAP polarisation maps at 33, 41, and 61 GHz (Ka, Q, and 
V bands) (Page et al. 2007; Bennett etal. 2012). However, we 
introduce one modification to this pixel-based likelihood code, 
replacing the spherical harmonics coefficients of the temperature 
field, a T tm , in Eq. 24 and 25 with those derived from the Planck 
temperature map derived by Commander, for which the Galactic 
plane has been replaced with a Gaussian constrained realization. 

In Fig. 33, we compare constraints on r and A s as derived 
with this split likelihood with those obtained through an exact 
brute-force evaluation of Eq. 21, simultaneously including tem- 
perature and polarisation measurements at A^de = 16. The two 
methods produce almost indistinguishible results. 

In Appendix E we assess the robustness of the WMAP polar- 
isation likelihood with respect to dust contamination, by repla- 
cing the WMAP polarised dust template with the far more sensit- 
ive HFI 353 GHz polarisation map. We find that the optical depth 
to reionization, r, is reduced by about 0.5 - 1 <x, depending on the 
template removal method adopted. However, since the Planck 
polarisation maps are excluded from the current data release, we 
adopt the WMAP polarisation likelihood without further changes 
for now, and will return to this topic in the next data release. 

8.3. Low -I power spectrum - consistency and robustness 

In this section, we present the low-£ Planck CMB temperat- 
ure power spectrum derived using the Commander approach de- 
scribed above, and assess its robustness through comparisons 
with three alternative foreground-cleaned Planck CMB maps 
(NILC, SEVEM, and SMICA; Planck Collaboration XII 2013), 
as well as with the 9-year WMAP ILC temperature map 
(Bennett et al. 2012). 

As a first consistency test, we compute the power 
spectrum from each map using Bolpol (Gruppuso et al. 
2009; Molinari et al. 2013), an implementation of the quad- 
ratic maximum-likelihood power spectrum estimator (Tegmark 
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1997). Each map is smoothed to an effective resolution of 
329.8 r FWHM, to suppress aliasing from high multipoles 
(Keskitalo et al. 2010), and repixelized on an N S ^ Q = 32 
HEALPix grid. Gaussian white noise with a variance of 4 yuK 2 
is added to each map to regularize the noise covariance matrix. 

Here we adopt the U78 common mask, defined in 
Planck Collaboration XII (2013), to exclude regions of high 
Galactic emission, leaving 78% of the sky for analysis. We re- 
move the observed monopole and dipole in the mask. The res- 
ulting power spectra up to I < 64 are shown in the top panel of 
Fig. 34, while the bottom panel shows the power spectrum re- 
siduals of each map relative to the Commander map. Note that 
the same noise realization was added to each map, and the reg- 
ularization noise therefore contributes little in this plot. For the 
different internally-derived Planck maps, no residual spectrum 
exceeds < 100 yuK 2 and is typically < 50 yuK 2 at I > 10. The 
WMAP spectrum exhibits significantly larger residuals, and are 
typically of the order of -100 fiK 2 at £ > 30. 

Figure 35 shows the Planck and WMAP temperature power 
spectra derived directly from the respective likelihood code, 
while Fig. 36 shows the corresponding constraints on the 
two-parameter amplitude-tilt model employed in Sect. 8.1.2, 
including multipoles between 1-2 and 30. Neglecting the 
minor differences in the masks adopted by the two codes, 
these power spectra and parameter constraints are largely 
dominated by cosmic variance, and one should therefore expect 
the two distributions to be almost identical. Instead, from 
Fig. 36 we see that the WMAP low-£ spectrum is 2.5-3% 
higher than the Planck spectrum. For a detailed discussion 
of this discrepancy, including a comparison at higher I, see 
Planck Collaboration XI (2013). Here we only note that the 
effect is robust with respect to foreground removal and power 
spectrum evaluation algorithms, and also point out that the effect 
at low multipoles is too large to be explained by uncertainties 
in the Planck transfer functions (Planck Collaboration II 
2013; Planck Collaboration VII 2013) or calibration 
(Planck Collaboration V 20 1 3 ; Planck Collaboration VIII 
2013). Also note that the amplitude of the low-/' spectrum 
relative to the Planck best-fit model, (q, n) = (1,0), derived 
including the full multipole range between 2 < I < 2500, is 
somewhat low in Fig. 36, with a best-fit amplitude of q ~ 0.9. 
This observation is discussed and quantified in greater detail in 
Sect. 9.3. 



9. The Planck CMB spectrum and likelihood 

9.1. Hybridisation of low- and high-l likelihoods 

The high-/' and low-/' likelihoods introduced in Sects. 2 and 8 
each describe only a part of the full Planck data set. To estimate 
cosmological parameters from all the angular scales probed by 
Planck, they must be combined into a single likelihood function 
that describes all multipoles from I - 2 to 2500. 

In principle, it is desirable to include as many multipoles as 
possible in the low-£ likelihood, since it captures the full non- 
Gaussian structure of the likelihood. The Gaussian approxim- 
ation for the likelihood using pseudo- spectra also improves at 
higher multipole due to the increasing number of degrees of free- 
dom (Efstathiou 2004). For Planck we adopt a transition mul- 
tipole of ^trans = 50, a compromise between obtaining robust 
convergence properties for the low-/' likelihood, and ensuring 
that the Gaussian approximation holds for the high-£ likelihood 
(Hamimeche & Lewis 2009). 
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Figure 34. Top panel: temperature power spectra evaluated from downgraded Planck maps, estimated with Commander, NILC, 
SEVEM, or SMICA, and the 9-year WMAP ILC map, using the Bolpol quadratic estimator. The grey shaded area indicates the 1 cr 
Fisher errors while the solid line shows the Planck ACDM best fit model. Bottom panel: Power spectrum differences for each 
algorithm/data set relative to the Commander spectrum, estimated from the spectra shown in the panel above. The black lines show 
the expected 1 cr uncertainty due to (regularization) noise. 
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Figure 35. Top: Comparison between the low-/ 7 Planck temper- 
ature power spectrum estimated by Commander and the 9-year 
WMAP spectrum (Bennett et al. 2012). Error bars indicate 68% 
confidence regions. Bottom: Difference between the WMAP and 
Planck low-/ 7 spectra. 
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Figure 36. Probability distribution for a two-parameter (amp- 
litude and tilt) model derived from the Planck (blue) and WMAP 
(red) low-/ 7 likelihoods, including angular scales between i = 2 
and 30. 



To combine the likelihoods, we must account for the weak 
correlations between the low- and high-/' components. We con- 
sider three options: 

1. Sharp transition: The low-/ 7 likelihood ends at £ max = 49; 
the high-/ 7 likelihood starts at £ m \ n = 50; no correlations are 
accounted for. 

2. Gap: The low-/ 7 likelihood ends at ^ max = 32; the high-/ 7 like- 
lihood starts at £ m { n = 50; no correlations are accounted for, 
but the gap is sufficiently wide that any correlations are neg- 
ligible. 



3. Overlap with correction: The low-/ 7 likelihood ends at £ max = 
70; the high-/ 7 likelihood starts at ^ max = 50; the double- 
counting of the overlap region is accounted for by sub- 
tracting from the log-likelihood a contribution only includ- 
ing 50 < I < 70 as evaluated by the Commander estim- 
ator. Under the assumption that no correlations extend from 
I < 50 to £ > 70, this approach is exact (for further details, 
see Gjerl0w et al. 2013). 

We estimate cosmological parameters using all three meth- 
ods, and find that the posterior means typically vary by < 0.1 cr. 
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The largest variation is seen including the running of the spec- 
tral index of scalar perturbations, in which the posterior mean 
changes by 0.2 cr. Further, all deviations at the 0. l-0.2cr level are 
seen for case 2 above, which excludes data compared to the other 
two; case 1 and 3 give nearly indistinguishable results. Since 
case 1 is implementationally simpler, and can be estimated more 
efficiently (see Sect. 8), we select this method, adopting a sharp 
transition at £ max = 50. 

9.2. The Planck power spectrum and ACDM constraints 

Using the full Planck likelihood, we now present the final 2013 
Planck CMB power spectrum. For this, we fix all nuisance para- 
meters to their maximum-likelihood values. The resulting spec- 
trum is shown in Fig. 37 together with the corresponding best-fit 
six-parameter ACDM model. The agreement between the obser- 
vations and the model is excellent over most of the multipole 
range. Only at low is it possible to see a systematic offset in 
the form of a slight power deficit; this will be addressed separ- 
ately in the next section. 

Table 8 provides a summary of the ACDM para- 
meters derived using the methodology described in 
Planck Collaboration XVI (2013) from the Planck likeli- 
hood. Here we use the same prior ranges on all parameters as 
in Planck Collaboration XVI (2013). These are as in Table 5, 
except for A™, A tSZ , and A kSZ , which are modified to [0,20], 
[0, 10], and [0, 10] respectively. Results are given for Planck 
alone, and in combination with the low-£ WMAP polarisation 
likelihood (Planck+WP). For each case, we report both pos- 
terior maximum and mean values. Uncertainties denote 68% 
confidence limits. 

A detailed discussion of these results is presented in 
Planck Collaboration XVI (2013), including an analysis of ex- 
tended cosmological models, and their compatibility with other 
astrophysical data sets. The bounds derived from Planck alone 
are significantly tighter than those from the 9-year WMAP data 
alone, and comparable or better than those inferred from WMAP 
combined with SPT and ACT observations. These new con- 
straints provide a precision test of the ACDM model. In general, 
we find good agreement with results derived from other astro- 
physical data sets, although there are a few exceptions that are in 
moderate tension with Planck (Planck Collaboration XVI 2013). 

Considering each of the six ACDM parameters in turn, we 
first note that Planck constrains the physical baryon density 
to Q^h 2 - 0.02207 ± 0.00033, which is in remarkable agree- 
ment with standard BBN predictions based on a determination of 
the primordial abundance of deuterium, Q^h 2 = 0.021 ± 0.001 
(Ioccoetal. 2009), but with a fractional uncertainty of 1.5,%, 
three times smaller than the BBN uncertainty. The physical dens- 
ity of dark matter is measured with a fractional uncertainty of 
2.6%, providing new constraints on specific dark matter pro- 
duction scenarios. The single most precise parameter, however, 
is the angular size of the sound horizon at the last- scattering 
surface, which is measured with a fractional uncertainty of 
0.065 % by Planck, improving on the combined WMAP, ACT, 
SPT, and SNLS33 constraint by a factor of two. 

Next, given that no polarisation data are included in the cur- 
rent data release, it is remarkable that Planck alone constrains 
the optical depth to reionization, r, with a fractional error of 
40%. This is made possible by Planck's high angular resolution 
and sensitivity, which allows a high signal-to-noise measure- 
ment of lensing in the small-scale CMB power spectrum. This 
in turn breaks the well-known e~ 2T A s degeneracy between r and 



the amplitude of scalar perturbations, A s . The fractional uncer- 
tainty on A s from Planck alone is 7 %. 

Having sufficient power to measure r from small angu- 
lar scale temperature data, Planck naturally also provides very 
strong constraints on the spectral index of scalar perturbations, 
n s , leading to a fractional uncertainty of 0.97%. The scale- 
invariant Harrison-Zeldovich spectrum, n s = 1, is ruled out 
at a significance of 4.1cr from the Planck temperature spec- 
trum alone. The analyses presented in Planck Collaboration XVI 
(2013) and Planck Collaboration XXII (2013) show that the 
preference for a (red) tilted primordial spectrum remains very 
strong also within most extensions beyond the minimal ACDM 
model. The implications of this results for inflationary models 
are discussed in Planck Collaboration XXII (2013). 

With our choice of cosmological parameters, the Hubble 
parameter, Hq, and the fractional density of the cosmological 
constant, £l A = I- Q m , are derived parameters. They are probed 
by CMB observations mainly through their impact on #mc, and, 
to lesser extent, by the impact of Da on the late-time integrated 
Sachs- Wolfe effect. Since Omc is accurately measured, a par- 
ticular combination of Ho and is very well constrained by 
Planck, although in a model-dependent way; 6* depend on other 
cosmological parameters, such as the spatial curvature radius, 
neutrino masses, the number of relativistic degrees of freedom, 
or a possible dark energy equation of state parameter. 

The results reported in Table 8 rely on the assumption of 
a flat ACDM cosmology with three neutrino species, two of 
which are assumed to massless and one featuring a small mass 
m v = 0.06 eV, reflecting the lower bound on neutrino masses im- 
posed by neutrino oscillation experiments. Under these assump- 
tions, Planck finds preferred ranges for Ho and Qa that are lower 
than previous CMB experiments. For instance, Planck+WP 
gives Hq = 67.3 ± 1.2 kms _1 Mpc _1 , to be compared with 
70.5 ± 1.6 km s _1 Mpc _1 for the combined WMAP9+eCMB data 
set presented by Hinshaw et al. (2012). The underlying cosmo- 
logy in the two analyses is the same, excepted for the small neut- 
rino mass introduced in our default ACDM model. However, if 
we assume all three neutrino species to be massless, our best-fit 
and mean values for Hq increase only by 0.6 km s _1 Mpc _1 . Thus, 
the tension is clearly driven by the data rather than by theoretical 
assumptions. Planck Collaboration XVI (2013) shows that our 
results for Hq and Qa are in very good agreement with Baryon 
Acoustic Oscillation data, but in moderate tension with other 
cosmological probes. For instance, our Planck+WP bounds on 
Hq disagree at the 2.5cr level with direct determinations of the 
Hubble parameter using cepheids and supernovae (Riess et al. 
1998) or quasar time delays (Suyu et al. 2012), as well as with 
the results of the Carnegie Hubble Program (Freedman et al. 
2012). Our bounds on Qa are in a slight 2cr tension with 
the results of the SNLS supernovae collaboration (Conley et al. 
2011; Sullivan et al. 2011), although in better agreement with 
the Union2.1 compilation (Suzuki et al. 2012). Our combined 
determination of erg and Q m shows larger tension with recent 
data based on cosmic shear or cluster count techniques. On the 
other hand the Planck best-fit ACDM model is in good agree- 
ment with the halo power spectrum derived from the luminous 
red galaxy catalogue of the Sloan Digital Sky Survey (Reid et al. 
2010), especially when the analysis is restricted to linear scales. 

9.3. Significance of the low-l tension with ACDM models 

From the above discussion, it is clear that the ACDM frame- 
work provides an excellent model for most of the Planck data. 
However, as noted in Sect. 8.3, and seen in Fig. 37, the low-^ 
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Figure 37. The 2013 Planck CMB temperature angular power spectrum. The error bars include cosmic variance, whose magnitude 
is indicated by the green shaded area around the best fit model. The low-£ values are plotted at 2, 3, 4, 5, 6, 7, 8, 9.5, 1 1.5, 13.5, 16, 
19, 22.5, 27, 34.5, and 44.5. 



Table 8. Constraints on the basic six-parameter ACDM model using Planck data. The top section contains constraints on the six 
primary parameters included directly in the estimation process, and the bottom section contains constraints on derived parameters. 



Planck 



Planck+WP 



Parameter Best fit 

Q h h 2 0.022068 

Q c h 2 0.12029 

1OO0mc 1.04122 

r 0.0925 

n s 0.9624 

ln(10 10 A s ) 3.098 

Q A 0.6825 

Q m 0.3175 

cr 8 0.8344 

Zre H.35 

H 67.11 

10 9 A S 2.215 

Q m h 2 0.14300 

Age/Gyr 13.819 

z* 1090.43 

1000, 1.04139 

Zeq 3402 



68% limits 



Best fit 



68% limits 



0.02207 ± 0.00033 
0.1196 + 0.0031 
1.04132 ±0.00068 
0.097 ± 0.038 
0.9616 ± 0.0094 
3.103 + 0.072 



0.022032 

0.12038 

1.04119 

0.0925 

0.9619 

3.0980 



0.02205 + 0.00028 
0.1199 + 0.0027 
1.04131 ±0.00063 

0.089t8S 
0.9603 ± 0.0073 
3 089 +0 024 



0.686 ±0.020 0.6817 

0.314 ±0.020 0.3183 

0.834 ± 0.027 0.8347 

11.4+4-0 n 37 

67.4 ± 1.4 67.04 

2.23 ±0.16 2.215 

0.1423 ±0.0029 0.14305 

13.813 ±0.058 13.8242 

1090.37 ± 0.65 1090.48 

1.04148 ±0.00066 1.04136 

3386 ± 69 3403 



0.685^-018 
0-315^-016 

0.829 ± 0.012 
11.1 ± 1.1 
67.3 ±1.2 

Z ' i ^ D -0.060 

0.1426 ±0.0025 
13.817 ±0.048 
1090.43 ± 0.54 
1.04147 ±0.00062 
3391 ± 60 
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Table 9. Results of the Hausman test applied to the temperature 
power spectrum for 2 < £ < 32. 



Data set 


e obs 


P( Sl < s° bs ) 






[%] 


Commander 


-0.647 


0.73 


NILC 


-0.649 


0.73 


SEVEM 


-0.804 


0.50 


SMICA 


-0.589 


1.33 


WMAP9 ILC . . 


. . -0.234 


7.18 



Planck temperature power spectrum appears to be in some ten- 
sion with the best-fit Planck ACDM model, which for Planck is 
almost exclusively determined by the small-scale spectrum. In 
this section we assess the significance and impact of this tension 
between low and high £s using three different statistical tests. 

We start by applying a modified Hausman test (Polenta et al. 
2005; Planck Collaboration II 2013) to the low-£ spec- 
tra derived from the four foreground-cleaned Planck maps 
(Planck Collaboration XII 2013) and the 9-year WMAP ILC 
map, using multipoles up to £ max = 32. This test uses the statistic 
si = sup r £(4iax, r), where 

^ int(^ max r) 

B(£ max ,r)=-j= Yj H € ,re [0,1] (29) 

(30) 




VVarQ 



and Ce and Q denote the observed and model power spec- 
tra, respectively. Intuitively, this statistic measures the relat- 
ive bias between the observed spectrum and model, meas- 
ured in units of standard deviations, while taking into account 
the so-called "look-elsewhere effect" by maximizing s\ over 
multipole ranges. We use realistic Planck 'FFP6' simulations 
(Planck Collaboration I 2013) to derive the empirical distribu- 
tion of s\ under the null hypothesis. Figure 38 compares the res- 
ults obtained from the data with the simulation distribution, and 
Table 9 lists significances. As measured by this statistic, we see 
that a negative bias is found in the low-£ Planck power spectrum 
relative to the ACDM model at the 99% confidence level. 

For the WMAP ILC map the significance of the negative bias 
nominally decreases to 93%. This is consistent with the find- 
ings in Sect. 8.3, where it was shown that the WMAP temperat- 
ure power spectrum is 2.5-3 % higher than the Planck spectrum 
at low £'s. However, as discussed in Planck Collaboration XI 
(2013), a similar amplitude difference between the two exper- 
iments is also seen at smaller scales. Since the current test com- 
pares the observed WMAP data with the best-fit Planck ACDM 
model, the present test is not optimal for assessing internal con- 
sistency between low and high £s within the WMAP data. 

Next, to obtain a quantitative measure of the relative power 
discrepancy between low and high £s, we fit the two-parameter 
amplitude-tilt power spectrum model (see Sect. 8.1.2) to the 
Planck data using the low-£ likelihood restricted to various mul- 
tipole ranges defined by 2 < £ < £ max , where £ max is allowed to 
vary. Thus, this measures the amplitude of the low-£ spectrum 
relative to the best-fit Planck ACDM spectrum, which is driven 
by the smaller angular scales. Figure 39 shows the resulting con- 
straints on the power spectrum amplitude, q, as a function of 
Ciax, after marginalizing over the tilt, n. For comparison, we also 
show similar constraints derived using the low-£ WMAP temper- 
ature likelihood up to £ = 30. The best-fit amplitude is q ~ 0.9 



Commander 
Severn 
Smica 
Nile 

WMAP9 



si 



Figure 38. Results of the Hausman test applied to the temperat- 
ure power spectrum for 2 < £ < 32. The black histogram shows 
the expected distribution, estimated with simulations, of the s\ 
test statistic. The vertical bars represent Planck CMB maps and 
the 9-year WMAP ILC map. Note that the statistic is indistin- 
guishable for the NILC and Commander maps. 
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Figure 39. Power spectrum amplitude, q, relative to the best-fit 
Planck model as a function of ^max? 

as measured by the low-£ 
Planck and WMAP temperature likelihoods, respectively. Error 
bars indicate 68 and 95% confidence regions. 



for £ max = 20-35, different from unity at a statistical signific- 
ance of 2-2. 5 cr by this measure. The WMAP spectrum shows 
a consistent behaviour, up to the same overall scaling factor of 
2.5-3% between Planck and WMAP discussed above. We have 
verified that these results are insensitive to the (well-known) low 
quadrupole moment by excluding from the analysis; the large 
cosmic variance of this particular mode results in a low overall 
statistical weight in the fit. 

Finally, we assess the impact of the low-£ power deficit on 
the ACDM model estimated using the Planck likelihood 1 1 (aug- 
mented with the WMAP polarisation likelihood). We fit a low-/' 
rescaling amplitude, A\ ow for £ < £\ ow jointly with the ACDM 
parameters, i.e., Q = A low C^ CDM for £ < 4>w and Q = Cf CDM 



11 We have verified that the following results are insensitive to 
whether Plik or CamSpec are used for the high-^ likelihood. 



34 



Planck collaboration: CMB power spectra & likelihood 



for £ > £\ ow . Figure 40 shows the resulting posterior distribu- 
tions for Ai ow for £\ ow = 32 (green) and £\ ow = 49 (blue). The 
purple line shows the same when replacing the Planck low-£ 
likelihood with the WMAP \ow-£ likelihood (4> w = 32). The 
corresponding best-fit values are A\ ow = 0.899 ± 0.046 (Planet, 
4>w = 32), A low = 0.953 ± 0.033 (Planck; 4>w = 49) and A low = 
0.953 ± 0.048 (WMAP; £ low = 32), respectively. As already 
noted in Sect. 8.3, these values are too large to be explained 
by the < 1% uncertainties in the Planck transfer functions 
(Planck Collaboration II 2013; Planck Collaboration VI 2013). 




0.7 0.8 0.9 1.0 1.1 

Figure 40. Posterior distributions for the low-£ spectrum amp- 
litude, Ai ow , estimated using the Planck likelihood, with £\ ow = 
32 (green) and £\ ow - 49 (blue). The purple line show the dis- 
tribution derived using the WMAP temperature likelihood with 

Aow = 32. 

In Fig. 41 we show the posterior distributions for £2 c /z 2 , n s 
and #0 after marginalizing over A iow for £\ ow = 49. (Adopting 
£\ ow = 32 results in negligible differences for all parameters ex- 
cept Ai ow ). Shifts of 0.6-l<x are observed compared to the refer- 
ence model, Aiow = 1 • We note that Hq, which already has a 'low' 
value (for a detailed discussion, see Planck Collaboration XVI 
2013), prefers an even lower value when allowing a rescaling of 
the low-£ spectrum. As a final test, we replace the entire low-^ 
likelihood, both temperature and polarisation, with a Gaussian 
prior on the optical depth of reionization, r = 0.089 ± 0.014, 
matching the WMAP measurement (Hinshaw et al. 2012). The 
resulting posteriors are shown as purple lines in Fig. 41, and 
agree well with the case including a low-£ scaling factor, but are, 
in fact, slightly further away from the reference model. Although 
not very significant in an absolute sense, these results do indic- 
ate that the high-/ 7 likelihood is challenged in finding models that 
also fit the low-£ power spectrum. 

To summarize, we have phenomenologically quantified a 
tension between the low-£ CMB power spectrum at £ < 40 
and the Planck best-fit ACDM model. Its significance varies 
between 2.5 and 3 <x depending on the estimator used. The ef- 
fect is seen in all four Planck foreground-cleaned CMB maps 




0.113 0.122 0.130 0.930 0.955 0.981 




62.9 66.8 70.7 0.8 0.9 1.0 1.1 



Figure 41. Comparison of the posterior distributions for £l c h 2 , 
ft s , and Hq for the default six-parameter ACDM model con- 
strained by Planck (red); compared to the case when we allow 
a variable low-£ power spectrum amplitude at £ < 49 (blue); 
and when replacing the low-£ temperature likelihood with a 
Gaussian prior on r, the optical depth of reionization (purple). 
The lower right panel shows the posterior distribution for the 
low-£ amplitude, Ai ow . 

with little variation. It is also present in the 9-year WMAP data, 
although an overall amplitude difference of 2.5-3 % between the 
data sets complicate a direct comparison. To make further pro- 
gress, one would seek to establish a physical model that pre- 
dicts a low-^ power deficit compared to high Ts, and that may 
also predict other observable effects which may be tested with 
cosmological data. Such a model may be related to the tentat- 
ive detections of violations of statistical isotropy discussed in 
Planck Collaboration XXIII (2013), e.g., the low CMB temper- 
ature variance, the hemispherical power asymmetry, or the align- 
ment between the quadrupole and octopole moments. 

10. Discussion & conclusions 

We have presented the Planck likelihood, which provides a de- 
tailed and accurate characterisation of the two-point statistics of 
the CMB temperature field, accounting for all significant sources 
of uncertainty; statistical, instrumental, and astrophysical. This 
likelihood function allows us to present an estimate of the CMB 
temperature power spectrum that spans more than three decades 
in £ with unprecedented precision; a spectrum that saturates the 
cosmic variance limit at all scales > 0.1°, nearly exhausting the 
information content of the temperature anisotropies, and, in fact, 
is becoming limited by uncertainties due to astrophysical fore- 
ground modelling. This is precisely what was originally prom- 
ised at the time when Planck was selected by ESA in March 
1996. 

On large angular scales, £ < 50, the Planck likelihood is 
based on a Gibbs sampling approach that allows joint CMB 
power spectrum and component separation analysis, while ac- 
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curately marginalizing over a physically motivated foreground 
model constrained by the 30-353 GHz Planck frequencies. On 
intermediate and small scales, the Planck likelihood employs a 
fine-grained set of cross-spectrum combinations among the 100, 
143, and 217 GHz detector maps to constrain the high-/' CMB 
power spectrum, ensuring that no noise bias can compromise the 
results, while at the same time allowing for physical foreground 
modelling in terms of power spectrum templates. This emphasis 
on physical foreground modelling has made it possible to com- 
bine the full power of the Planck data with observations from 
higher-/' CMB experiments. 

We have validated our results through an extensive suite of 
consistency and robustness analyses, propagating both instru- 
mental and astrophysical uncertainties to final parameter estim- 
ates. Further, we have studied in detail the well-known degen- 
eracies that exist between the foreground and cosmological para- 
meters at high £s when only including Planck observations, and 
shown that they have only a weak impact on cosmological con- 
clusions. 

On a more detailed level, we draw the following conclusions: 

- The consistency between power spectra measured at differ- 
ent frequencies is remarkable. In the signal-dominated re- 
gime for single detectors, at £ < 1000, the cross-spectra 
show an RMS dispersion of a few yuK 2 in multipole bands 
of 81 - 31. This confirms the relative calibration of the 100, 
143, and 217 GHz detectors to -0.2%. 

- The differences, AD { , between the 143 x 143, 143 x 217, and 
217 x 217 cross-spectra averaged over multipole bands of 
61 * 31 have a dispersion over 800 < £ < 1500 of 9, 5, and 
5//K 2 , respectively, after subtracting the best-fit foreground 
model. This dispersion is not primarily of instrumental ori- 
gin, but can be predicted from a model of the chance correl- 
ations between foregrounds and CMB fluctuations. 

- At high £s, the power spectrum of the four foreground- 
cleaned CMB maps derived through component separation 
are consistent within their uncertainties. The cosmological 
parameters derived from these maps are consistent with 
those estimated by the Planck likelihood for £ < 2000, des- 
pite very different foreground models. 

- At low £s, the power spectrum differences among the four 
foreground-cleaned CMB maps are below 50 yuK 2 for nearly 
every single multipole. Residuals with respect to the 9-year 
WMAP ILC map are slightly larger, typically 100 K 2 or 
more. A detailed comparison between Planck and WMAP 
reveals a systematic power spectrum amplitude difference at 
the 2% - 3% level that cannot be accounted for within the 
Planck instrumental error budget. This is consistent with the 
findings presented in Planck Collaboration XI (2013). 

- Parameters derived from the 70 GHz Planck frequency map 
are in excellent agreement with the reference results derived 
using the Planck likelihood; when the latter is limited to 
£ < 1000, the agreement is even more striking. This con- 
firms the strong internal consistency between the LFI and 
HFI instruments. 

- The best-fit ACDM model derived from the Planck likeli- 
hood predicts TE and EE spectra in exquisite agreement 
with the measured polarization signature over a broad range 
of frequencies (70 to 217 GHz) and multipoles (£ < 1000). 
At 100, 143, and 217 GHz, the instrumental noise in the EE 
spectrum is at the yuK 2 level for £ < 1000, and the visible dif- 
ferences between the spectra are dominated by their different 
levels of foreground contribution, not by systematic effects. 



- We report a tension between the Planck best-fit ACDM 
model and the low-£ spectrum in the form of a power de- 
ficit of 5-10% at £ < 40, with a statistical significance of 
2.5-3 cr. Thus, while the minimal ACDM model provides 
an outstanding fit for intermediate and small angular scales, 
this tension may suggest that the model is incomplete. In this 
respect, it is worth noting that other, but possibly related, an- 
omalies have been reported in a companion paper studying 
statistical isotropy in the Planck sky maps at statistically sig- 
nificant levels. 

In summary, we find that the majority of the Planck data can 
be described by a minimal six-parameter ACDM model with a 
very high degree of accuracy. Within this model the statistical 
uncertainties are dominated by astrophysical foreground model- 
ling by scales of £ ^ 1500. At lower £s, the unprecedented qual- 
ity of the Planck data is such that the only fundamental limit is 
that we can only observe one CMB sky. In other words, Planck 
is cosmic variance dominated at £ < 1500, extragalactic fore- 
ground dominated at £ > 1500, and dominated nowhere by in- 
strumental noise or systematic errors. 

Using only Planck data, we report a detection of n s < 
1 at more than 4<x confidence, significantly stronger than 
the limit derived from WMAP, SPT, ACT, and SNLS3 
combined. Complementing the Planck observations with the 
9-year WMAP polarization data increases the significance 
to 5 Act. The multipole range above £ > 1500 is cru- 
cial for constraining possible extensions to the minimal 
ACDM model; for a detailed exploration of a wide range 
of such models, see Planck Collaboration XVI (2013) and 
Planck Collaboration XXII (2013). There we report some ten- 
sions among the CMB damping tail parameters, including £l K , 
n mn , and Fp. However, none of these indicate significant depar- 
tures from the ACDM framework. 

In the near future, we will extend our analysis to produce a 
cosmic variance limited likelihood and power spectrum reach- 
ing to higher multipoles. To some extent, this will be achieved 
through more sophisticated astrophysical foreground modelling, 
and by exploiting additional frequency information. However, 
the two major steps forward will be, first, to include the Planck 
polarization observations in the likelihood analysis, and, second, 
to exploit the full data set generated by the two Planck instru- 
ments. The amount of HFI data available for analysis is nearly 
double that which is presented here, and the LFI instrument is 
still observing at the time of writing. 
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Appendix A: High-l likelihood details 

A.1. Power spectra and the coupling matrix 

We denote the pixel weight function for temperature by wj . The 
pseudo-spectra of Eq. 1 are constructed using the following: 



(A.1) 



where the sum is over the pixels in the map. 

The coupling matrix appearing in Eq. 2 is given by 
Hivon et al. (2002): 



2,(2^3 + me 3 0' 

h v ' 

= (2{ 2 + l)E TT (d Ji,W) 



M TT 2 = (2^+1) 



(A.2) 
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where for the cross spectrum (i, j), We is the power spectrum of 
the window function 



We form the linear combination of individual cross-spectra, 
for each multipole, in the following way: 



w ij = - 

1 (2^+1) 



(A.3) 



C 



z 

ijck, i±j 



(b T / j )Hp T ,) 2 



(A. 10) 



A.2. Pseudo-Ce covariance matrices 

For the case of narrow window functions and uncorrected pixel 
noise (o~J) 2 , the covariance matrices can be approximated as 



Here the index k denotes the particular frequency cross- spectrum 
combination (e.g., 100 x 100, 143 x 217), the coefficients y t de- 
note the multiplicative factors for each map, b^ lj is the (isotrop- 
ised) beam transfer function for the map combination ij, and pi 
is the isotropised pixel window function 12 . The coefficients 
are normalized so that 



ijck, i±j 



1, 



How can we determine the coefficients o^-? A near optimal 



(Acf j AC T / q ) « C]CI [EttW, ?, W (ip)Uq) ) + E TT (£, t\ W {iq)Up) )\ 
+ (CjcJ) 1/2 x 

[E TT (tJ',W 2nip)Uq) ) 

+E TT {£,£',W 2T{iq){jp) ) (A.4) combination, Xh, is given by solving 

+E TT ((,(',W 2TUpKiq) ) 
+E TT (£J',W 2T(iq>iip> ) 
+ E TT ({, £', W TT(ip)Uq) + E TT ({, (', W TT<iq)Up) ) 



0. 



(A. 11) 



pq 



pq 



(A.12) 



where S is the matrix defined in Equation A.2. The window 
functions are given by: 



w 



(ij)(pq) 
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+ 1) ^ 



(2€ + 1) 



W (m W (m . 



fyTTQJXpq) _ \ y ~T{ij)-T( Pq )* 

W t ~ (2^+l)Zj W (m W (m ' 



2T(if)(pq) 



w: 



where 
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+ 1) ^ 



(2€ + 1) 



~ (ij) ~ T(pq)* 
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6m 



r T(ij) 



s 

Y,(<r T s fw i s w j s a 2 s r tm {e s ). 



(A.5) 
(A.6) 
(A.7) 

(A.8) 
(A.9) 



To avoid cumbersome notation, we have omitted indices from 
the theoretical spectra appearing in Eq. A.4. In practice, these 
spectra include unresolved foreground contributions and are 
smoothed by the appropriate beam transfer functions b l K In ad- 
dition, these covariance matrices are corrected for the pixel win- 
dow functions pi (i.e., covariance matrices such as (AC/ J AC € , pq ) 
are divided by p 2 p 2 ,) 



A.3. Combining intra-frequency cross-spectra 

For Planck, the vector containing the power spectra, and its as- 
sociated covariance matrix, are both large and so require sub- 
stantial compression to make the computation of a high-/' like- 
lihood fast enough for parameter estimation. As described in 
Appendix A.7, after we correct for the 'effective' calibration 
factors for each individual detector set, the power spectra at 
each freqency are consistent to extremely high accuracy. Any 
remaining residuals have a negligible impact on the cosmolo- 
gical analysis. Thus, we combine the cross- spectra from differ- 
ent detectors within a given frequency combination into a single 
power spectrum. We do not average across frequency combin- 
ations since the unresolved foregrounds depend on frequency. 
Further compression can be accomplished, if desired, only after 
unresolved foreground parameters have been determined. 



where M p l q is the block of the inverse covariance matrix appro- 
priate to the spectrum combination k. If the covariance matrix M 
accurately describes the data, the solution of Eq. A.12 properly 
accounts for the correlations between the cross-spectra. Solving 
Eq. A.12 requires the inversion of a large matrix, so we adopt 
a simpler solution by weighting each estimate by the diagonal 
component of the relevant covariance matrix, e.g., 

a] TiJ oc 1 /Cov(Cf 7 C] lJ ). (A. 1 3) 

This has the effect of assigning each cross-spectrum equal 
weight in the signal dominated regime and the correct inverse 
variance weighting in the noise dominated regime. This is the 
correct solution in the noise dominated regime. The analysis of 
intra-frequency residuals presented in Appendix A.7 shows that 
in the signal dominated regime we see excess variance (with no 
obvious dependence on the detector/detector set combination) 
compared to what we expect from instrument noise alone. This 
excess variance is small compared to the signal and is caused 
by residual beam errors, consistent with the beam eigenmode 
amplitudes discussed in Appendix A.7, that are not included in 
the covariance matrices. This is our justification for assigning 
roughly equal weight to the spectra in the signal dominated re- 
gime. 

When we construct a likelihood from the combined estim- 
ates we construct the full covariance matrix including cross- 
correlations between the various spectra. In this matrix, the 
cross-correlations in the signal dominated regime are domin- 
ated by cosmic variance if different masks are used for different 
frequencies. If identical masks are used for all frequencies, the 
cross-correlations in the signal-dominated regime are dominated 
by the cross correlations between the CMB and unresolved fore- 
grounds, which are included in the analytic covariance matrices 
and act as a regularizing contribution (see Appendix C). 



A.4. Covariance matrix of combined spectra 

The estimates of the Planck cross-spectra are linear combina- 
tions of the pseudo-Q estimates, so their covariance matrices 
are given by e.g., 

Cov(C T /C T /) = Yj ^Y iJ aY Pq Coy(C T £ iJ cl Pq ). (A.14) 



12 Note that for the masks used here, the isotropised pixel window 
function provided by HEALPIX is sufficiently accurate. 
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Figure A.l. Toy power spectra drawn from a Gaussian distribution with mean given by a ACDM power spectrum and a covariance 
given by the band-averaged Q covariance matrix, (top), and the corresponding differences with respect to the input ACDM model. 
Note the apparent presence of "coherent oscillatory features" in the difference spectra. This which are fully described by the power 
spectrum covariance matrix. To assess the statistical significance of apparently "unexpected features" in the power spectrum, it is 
critical to include all sources of systematic errors in the evaluation, as these can, and do, introduce significant correlations among 
different Q's. 



Analytic expressions for these covariance matrices have been 
given in Efstathiou (2004, 2006); Hamimeche & Lewis (2008), 
and are described in Appendix A. 2. The covariance matrices are 
computed assuming a fixed fiducial theoretical model including 
an unresolved foreground model for each frequency combina- 
tion. Typically, the unresolved foregrounds introduce corrections 
to the covariance matrices of a few percent in the transition re- 
gion between signal and noise domination. In addition, we com- 
pute the fiducial model by applying appropriate beam functions 
b Tij for each detector combination. 

As discussed above, the number of coupling matrices re- 
quired to compute expressions such as Eq. A. 14 scales as 
A^ ap and so becomes prohibitively expensive as the number 
of cross-spectra becomes large. However, most of these coup- 
ling matrices are similar, differing primarily in the amplitude 
of the noise levels and in minor respects such as a small num- 
ber of missing pixels. We can therefore adopt the same masks 
and weightings for groups of cross-spectra and compute coup- 
ling matrices only for distinct combinations. This dramatically 
reduces the computational burden. A similar approach was ad- 
opted by Lewis (2008) to analyse the WMAP 5-year temperature 
maps. 

It is also straightforward to calculate covariance matrices for 
differences between different averages. If we form two spectra 
averaging over different detector combinations 



then the covariance matrix of the difference Cj fl - C Tb is simply 



c 



(A.15) 
(A.16) 



Cov(C T / - C] b ) = ( aijaPq + P ij P M ~ aij P M -P ij( * Pq ) 

(A. 17) 



x(AC T / J AC T / q ). 



As a pedagogical illustration of the importance of these cor- 
relations, we show in Fig. A. 1 two toy power spectra drawn from 
a Gaussian distribution with mean given by a ACDM spectrum 
and covariance given by Cov(Cj fl - C T { b ). That is, these spec- 
tra are not computed from a real sky map, but simply drawn 
directly from the Q error model, and therefore exclusively con- 
tain correlations modelled by the covariance matrix. The appar- 
ent "coherent oscillatory features" seen in the difference spectra 
(bottom panels) are therefore fully described by the CamSpec co- 
variance matrix, accounting for correlated beam and foreground 
uncertainties, mask-induced coupling etc. The left panel shows a 
typical realisation, while the case in the right panel is selected as, 
visually speaking, one of the most "peculiar" within a relatively 
small set of simulatoins. When assessing the statistical signific- 
ance of "unexpected features" in the real CMB spectrum, e.g., 
similar to those seen in Figs. 15 and 16, it is critical to account 
for these correlations. 



A.5. The "fiducial Gaussian" approximation 

We use a likelihood based on the so-called "fiducial Gaussian" 
approximation. Here we present our justification for this choice, 
based on an "expansion in covariance" of the exact likelihood in 
the exact full-sky, isotropic noise case. 

Assuming the CMB, noise, and foregrounds are Gaussian, 
then the probability distribution for the a\ m coefficients of a col- 
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lection of maps, given a model, is p = e s , where 

S =2^+1/2) (trfQ-'Qj+lnlQl), (A. 18) 

up to a model-independent normalization.Here is the matrix 
of empirical spectra at a given multipole, and Q are their ex- 
pectation values for the model in question. 

Now, a key point to note is that theoretical power spectra 
typically differ from each other at each I by less than they dif- 
fer from the observed Q, because of cosmic variance and noise 
in the latter. So we are justified in expanding Eq. A. 18 about a 
reasonable fiducial model. Considering a single value of 'ell for 
simplicity, writing 

C = C f + A, (A.19) 

we obtain 

S = S f + (l+ l^tr^Cf^ACf^QQ^A 
+Cr 1 ACf- 1 ACf- 1 Ci 



--Cr 1 AC f - 1 A + ... 



(A.20) 



to second order in A. We may now complete the square in A after 
extracting a term that is small if the fiducial model is accurate. 
Up to terms independent of A, we have, to second order in A: 

S 2 = (t+ l/2)tr(cr 1 AC f - 1 A(Cr 1 C- 1)) 

+ l -{t + l/2)tr ((C - C)Cf l (C - QCf" 1 ) . (A.21) 

Here we have recombined the perturbation and the fiducial 
model back together, using Eq. A.19, in the second term to ob- 
tain exactly the "fiducial Gaussian" likelihood. The first term is 
a correction to the fiducial Gaussian likelihood that is typically 
small if the fiducial model is accurate. 

One can motivate neglecting this term by noticing that in 
its absence the approximate likelihood is unbiased (as the exact 
one is). One trades getting second derivatives exactly right in the 
vicinity of the fiducial model with getting the position, though 
not the depth, of the minimum right. 

Vectorizing the distinct elements of C - C (following 
Appendix A of Hamimeche & Lewis 2009), and recognizing the 
coefficients as the inverse covariance matrix elements of the 
spectra under the fiducial model, we obtain 



1 



>fid 



{c VL -c 11 , c zl -c zl , . . .)M~ L (c u -c 11 , c zl -c zl , . . .y 

(A.22) 

where M is the fiducial covariance matrix of the spectra, and the 
upper indices run on the different pairs of frequencies. 

This suggests an easy generalization to the coupled cut sky 
pseudo-spectra, given our calculation of their covariances in 
Appendix A. 2. We now replace the power spectra above with 
corresponding appropriate averages of detector cross-spectra. 
With bold face now denoting spectra laid out as vectors, and M 
the grand fiducial covariance, our final action is: 



Note that if we fix the foreground model C Fk for each spec- 
trum k, together with the calibration coefficients and beam para- 
meters, we can minimize the likelihood (Eq. A. 23) with respect 
to a 'best-fit' primary CMB spectrum. This 'best-fit' spectrum is 
given by the solution of 

^(At^)^C™ B = Yj(M^) kk \c k 'C* - C F /\ (A.24) 

kk'V 



1 



Sfid = " c ' c " c ' • • -)AT (C -c u , c zl -c zl ,...y 

(A.23) 

Another advantage of the "fiducial Gaussian" approximation is 
that instrumental uncertainties (calibration errors, beam errors, 
etc.) do not appear in the inverse covariance, but only in the ex- 
pression of the theoretical spectra C T in Eq. A.23 above. 



kk'V 



where the c k are spectrum effective calibration factors (see 
Appendix A.7). The covariance matrix of the estimates C^ MB 
is given by the inverse of the Fisher matrix: 



(A.25) 



<AQ CMB AQ CMB ) = 2(Al^)**' 

\kk' 



A.6. Uncertainties on individual detector sets beams and 
calibrations 

Let us consider two detectors (or detector sets) X and Y. 
Neglecting instrumental noise, the cross-spectrum C XY,ohs is re- 

XY skv 

lated to the true one, C, ' , through 



r XY,obs _ r XY,sky w XY,eff,true 



(A.26) 



where jy^ Y ' eff ' tme [ s me effective beam window function. Note 
that because of the optical beam non-circularity and the Planck 

scanning strategy, W XY ± (wf x Wj Y f /2 when X ± Y, while 

W XY = W YX for any X and Y. In the I range of interest, Wf Y > 

0, so we denote W XY = (# XY ) , following the usual prescription 
for simple (circular) beam models. In what follows, we will drop 
the XY pair superscript except when they are required for clarity. 

Our analyses use the best estimated Cf 1 of the sky power 
spectrum, where the measured Q bs is corrected by a nominal 
effective window W\ 



eff ,nom . 



-<obs / T T/eff,nom 



_ ^-.sky ^eff ,true ^eff,nom 
^sky / D eff,true / D eff,nom\ 2 



(A.27) 



The ratio £p tme /£p nom , which determines the uncertainty on 
the angular power spectrum due to the beam, is estimated using 
Monte-Carlo simulations of planet transits. 

We estimate Bf Qm and W™ Qm from the Monte Carlo simula- 
tions as 



"1VK 

fl~» = £(W;) 1/2 /»MC, 
z=l 

"MC 

= ^ Wynuc, 



jymean 



and compute the deviations around the mean 
A^ = ln(4/^ ean ). 



(A.28) 



(A.29) 



(A.30) 



Since the relative dispersion of the simulated W l € is small (less 
than 1%), the deviations are well approximated by 



A^ Un(wyW™ m ). 



(A.31) 
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The matrix A then has nuc rows and £ max + 1 columns. 
Singular Value Decomposition (SVD) is given by 



Its 



A = MDV 7 



(A.32) 



where M is an orthogonal n M c x ^mc matrix (i.e., M r M = 
MM r = I WMC ), D is a diagonal matrix with hmc non-negative 
eigenvalues, and V is a matrix with £ max + 1 rows whose ^mc 
columns are orthonormal vectors (i.e., V r V = I„ MC ). Here l nMC 
is the identity matrix. 

The covariance matrix of the beam deviations is defined as 



C ee A r A/(^ MC - 1) 
= \D 2 \ T /(n MC - 1), 
from which we compute the eigenmode matrix 
E ee W T /(n MC - D 1/2 



(A.33) 



(A.34) 



using the SVD of A. Most of the statistical content of A or C is 
limited to the first few modes ftmodes with the largest eigenvalues. 
We therefore keep only the largest ft mo des = 5 of the E matrix. 
The beam uncertainty for a given spectrum is then given by 



J$i — 2J mean 



exp(g^ 



( ^mode s ^ 



(A.35) 



where g is a vector of independant Gaussian variates of unit vari- 
ance with ft m odes elements, and Ekif) is the k-th row of E. 

This can be generalized to a set of spectra. Taking three pairs 
of detector sets a = {UV}, b - {XY} and c = {ZT}, one can write 



(\ a A b A c ) = (M a M b M c ) . 



(E a 

E c 



(A.36) 



and the covariance matrix is given by 

C abc = f^a A b ^ _ f^a A b A c^ 
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(A.37) 

The beam errors can therefore be correlated (and in fact are 
strongly so, see Planck Collaboration VII 2013). In the next 
Appendix, this general covariance matrix is used to derive the 
beam error eigenmodes of combined spectra for the CamSpec 
likelihood. 



A.7. Calibration and beam uncertainties for the CamSpec 
likelihood 

Four effective cross-spectra are used in the CamSpec likelihood, 
each using an individually -prescribed Grange. For each of the ef- 
fective power spectra, all eligible mask- and beam-deconvolved 
cross- spectra Cf Y are used, weighted according to 



C p 



-2> 



XY,p 



c 



XY 



(A.38) 



XY 



with p labelling the effective spectrum. As describe in 
Appendix A. 6, uncertainties in the determination of the HFI ef- 
fective beams are described in terms of beam eigenmodes, Ek, 
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Figure A.2. The eigenmodes of the beam covariance matrix, 
shown for each cross- spectrum used in the analysis. The largest 
five modes are shown for each cross-spectrum. 

and distributions of the coresponding eigenvalues. To propagate 
beam errors into the likelihood, we start by using the eigenval- 
ues, along with the a* Y weights, to construct an appropriate co- 
variance matrix for the effects of beam errors in ))beam • 



^ l ^"))beam 



4 Z 2j 2j a * Y ' P a ™' P E ' Y{e) E ? in R f zw > 

ij XY ZW 

(A.39) 

where R XY,ZW is the correlation between the i-th eigenmode of 
the XY cross-spectrum with the j-th eigenmode of the ZW cross- 
spectrum. The portion of this matrix corresponding to the £- 
range used in the likelihood is then extracted, and itself singular- 
value-decomposed. We keep the first ft e ffmodes (typically five) ei- 
genmodes Ef {I), i = 1, . . . , ft e ffmodes> orthogonal over the Grange 
and normalized such that the sum of their outer product directly 
approximates the covariance: 



((e p € e*)>^*YEf(£)Ef(n 



(A.40) 



The eigenmodes are illustrated in Fig. A.2. 

Next we calculate a suitable covariance matrix between 
the eigenmodes. This requires the (non-diagonal in €) inter- 
effective- spectrum covariance matrices {{C p C^,))beam> given by: 



{{C P „ C?,»be; 



XY,p a ZW,q E XY 



(l)E zw {l')Rf zw . 



ij XY ZW 

(A.41) 

Given these matrices, we can "stack" the effective spectra to 
form a data vector X and form a grand beam-covariance mat- 
rix «XX T » beam . X has length n x = Z P (C^ ~ Cn + ! )- Zero " 
extending each eigenmode at both ends, and arranging these into 
a matrix, we can form an nx by ft e ffmodes • ^eff matrix of extended 
eigenmodes, E e s, where n e ff = 4. Now we imagine approximat- 
ing «^^ T ))beam as a correlated outer product of E&, 

((X X T » beam * E eS M eS E T Qff . (A.42) 

Requiring that the covariance be chosen to minimize the 
summed squared-difference between elements on the two sides 
yields: 

M eff = £ e T ff «XX T » beam £ eff . (A.43) 
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Map 



Mask 



N T 



100-dsl 
100-ds2 



2.717 x 10" 4 
1.144 x 10- 4 



143-5 
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143-6 


L 6.881 x 10 J 


1 

LHrD- 1 


J.Uo^ X 1U 


143-dsl 


L 2.824 x 10- 5 


143-ds2 


L 2.720 x 10" 5 


217-1 


L 1.159 xlO" 4 


217-2 


L 1.249 xlO" 4 


217-3 


L 1.056 xlO" 4 


217-4 


L 9.604 x 10" 5 


217-dsl 


L 6.485 x 10" 5 


217-ds2 


L 7.420 x 10- 5 



Table A.l. Noise estimates for the detector maps, applying 
the Galactic masks used in the CamSpec likelihood ('mask_3' 
for 100 GHz, retaining 58% of the sky, and 'mask_l' for 143 
and 217 GHz, retaining 37 % of the sky, combined with an ex- 
tragalactic point source mask). 



The Efffls and the associated covariance M e ff are then passed 
to the likelihood. 





A.8. Noise model of HFI detector sets 

For strictly uncorrected pixel noise (o~J) 2 , and pixel weights w/, 
the noise power spectra for the temperature maps are: 



N 1 



(A.44) 



Figure A.3. Noise spectra computed from difference maps for a 
selection of detector sets: 100 GHz (top), 143 GHz (middle), and 
217 GHz (bottom). Green lines show the spectra for T, purple 
for Q and magenta points for U. The solid black lines show the 
modeled fits to the spectra (Eq. A.46), and the coloured hori- 
zonal lines show the white-noise levels of Eq. A.44. These are 
computed using the same masks as used in Table A. 1 . 



with contribution to the pseudo-C^ estimates, for uncorrelated 
noise, of 

^ r = ^Z ( ^ )2w ^- (A - 45) 



N 1 



Values for N T are listed in Table A.l. There is a significant dis- 
persion in the noise properties of the two maps at 100 GHz. At 
143 and 217 GHz, the PSB maps have significantly lower noise 
than the SWB maps by a factor of two, as expected. 

The noise spectra for the Planck HFI maps are non- white. 
The following 7-parameter function 



100 



ffii/ioooy 3 



(A.46) 



is a flexible parameterization that provides accurate fits for all 
of the HFI channels. The first term on the left models the excess 
'l/f'-like noise while the second term models the 'bell shaped' 
noise spectrum at high multipoles introduced by time constant 
deconvolution applied to the time-ordered data, and the low-pass 
filter designed to remove high-frequency noise due to demodu- 
lation. Estimates of the noise spectra can be computed from 
difference maps constructed from different half-ring surveys 13 . 
Examples of fits to noise spectra for the 143 GHz and 217 GHz 
channels are shown in Figure A. 3. Note that the 100 GHz noise 



13 As described in the HFI Data Processing paper 
Planck Collaboration VI (2013), these difference maps provide an 
estimate of the noise level in the sum maps with an accuracy of about 
2%. 



spectra are significantly non- white. At 143 and 217 GHz, the de- 
viations from white noise are smaller. Since these cross spectra 
contribute almost all of the weight in the likelihood at high mul- 
tipoles, the modelling of non- white noise is not a critical factor 
in forming an accurate likelihood. 

We adopt a heuristic approach to fold departures from white 
noise into the power spectrum covariance estimates. We define a 
set of noise weight functions, e.g., 



_e 

NT ' 



(A.47) 



Wherever a <x 2 term appears in a covariance matrix, we multiply 
the appropriate coupling matrix by a factor 



(A.48) 



This heuristic approach can be partially justified by noting that 
for isotropic Gaussian noise over the full sky, the distribution of 
Q is given by the inverse Wishart distribution: 



dP{Ct\Ct) oc |W,|-(^)exp 



1 



where 



W/ 



1 



-(c T 



TT + N TT 



(A.49) 



(A.50) 



e.g., Percival & Brown (2006). In this special case, our heuristic 
correction is exact. Further justification of the accuracy of this 
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Table B.l. Series of Galactic and cosmology masks. 



Series 


Galactic Mask 


Apodised Galactic masks 


Cosmology Mask 





G22 


GA21 




1 


G35 


GA34 


CL31 


2 


G45 


GA38 


CL39 


3 


G56 


GA54 


CL49 


4 


G65 


GA60 





heuristic approach comes from direct comparisons with numer- 
ical simulations incorporating non- white noise (see Sect. 6) and 
from the accurate agreement of co variance matrices with the i- 
by-£ scatter measured in all of the cross-spectra used to form the 
likelihood. 



Appendix B: Sky masks 

We apply a threshold to the 353 GHz temperature map to define a 
set of diffuse Galactic masks shown in Fig. B.l. We refer to them 
using the percentage of the sky retained: G22, G35, G45, G56, 
G65. We also use a point source mask, labeled PS 96, which is 
based on the union of the point sources detected from the chan- 
nels in the range 100 to 353 GHz 

In order to avoid power leakage, we also derive a series of 
apodised masks. For the Galactic masks, we proceed as follows. 
First, we smooth each mask with a five-degree Gaussian beam, 
and zero any pixels below a threshold of 0.15. We then subtract 
0.15 from the remaining pixels, and rescale the resulting map 
by 1/(1 - 0.15). The resulting masks are shown in Fig. B.2. In 
order to retain sufficient sky area for the most conservative sky 
mask, a slightly less aggressive version of mask G22 was used 
to seed the apodization process for that case. Each point source 
is apodised to 30' FWHM, resulting in the PSA82 point source 
mask. A set of the resulting masks are shown in Fig. B.2. 

For all cosmology analyses, we use three of the apodised 
galactic and point sources masks: CL31, CL39, and CL49, which 
are shown in Fig. 2. Table B.l summarizes the various masks. 
For a limited set of tests in §3.2 we also used 'maskO' and 
'maskl' which combine the non-apodised Galactic masks G22 
and G35 with the apodised PSA82 point source mask. 



Appendix C: Chance correlations and 
inter-frequency consistency tests 

Here we explicitly show that, even if the foreground contamin- 
ation is much smaller than the CMB, chance cross-correlations 
can produce scatter in the inter-frequency power spectra that is 
large in the signal dominated regime. To see this, consider the 
case of two frequencies. Frequency 1 provides a faithful map of 
the CMB fluctuations. Frequency 2 contains a foreground com- 
ponent F. We therefore write the maps at the two frequencies 



as: 

X! = S, (C.2) 

X 2 = S + F, (C.3) 

with spherical transforms 

a \m ~ Sim, (C.2) 

a L ~ Slm + Ff m . (C.3) 



The power spectra of the two maps are therefore: 

c \ = w + 1) 2j S{mS *t™ = C? MB , (C.2) 

m 

C * = (2€ + 1) Y^ S{m + F ^ S €m + F U 
m 

= Q CMB +2Q CMBxF + Q F , (C.3) 

and the difference between the power spectra is 

Q 2 -C]=2Q CMBxF + C F . (C.l) 

If the CMB is uncorrected with the foreground, the first 
term will average to zero over a large number of CMB realiz- 
ations. But we observe only one realization of the CMB, and so 
the cross-term will dominate the inter-frequency residuals even 
if the foreground contamination is much lower than the CMB 
(C F «: C CMB ). This is the origin of the excess scatter between 
the 143 and 217 GHz power spectra at low multipoles shown in 
Fig. 10. 

We construct a specific example of this. The upper map in 
Fig. C.l shows an ILC map, estimating the CMB, generated 
from the 100, 143, 217, and 353 GHz maps. The map in the 
middle panel shows a 'fake' 217 GHz map, i.e. the sum of the 
ILC map and the 857 GHz map scaled in amplitude to match 
dust emission at 217 GHz. The real 217 GHz map is shown in 
the lower panel of Figure C.l. The 'fake' 217 GHz map is evid- 
ently quite a good match to the real 217 GHz. By rescaling the 
857 GHz map to estimate the dust emission at 143 GHz, we 
can generate a 'fake' 143 GHz map in an analogous way. These 
'fake' maps each contain two components by construction, and 
so the inter- frequency residuals from these maps will be domin- 
ated by the CMB -foreground cross term in Eq. C.l. 

The 143 and 217 GHz power spectrum difference from these 
fake maps are compared to the 143-217 residuals of the real data 
in Fig. C.2. The magenta points show the same maskl-maskO 
double-difference power spectrum between 217 and 143 GHz as 
shown in Fig. 3. The only difference here is that the smoothed 
dust fit of Eq. 9 has been subtracted from the spectra so that 
the points scatter around zero. There are advantages to using 
the double difference because: (a) it is insensitive to calibration 
differences between frequencies; (b) the contrast between dust 
emission and other foregrounds (point sources/SZ) is stronger 
in the area of sky defined by maskl - maskO and so the double 
difference power spectrum should be closer to the results from 
the fake maps, which use only a dust template 14 . The solid green 
line shows the double difference power spectrum computed from 
the fake maps. The amplitude of the scatter from the fake maps 
and the real data are very similar. In fact, there is almost point- 
by-point agreement between the results from the real data and 
the fake maps. This provides compelling evidence that the ob- 
served inter-frequency scatter at low multipoles is dominated 
by the CMB -foreground cross term in Eq. C.l rather than some 
mysterious systematic effect in the data. 

The blue points in Figure C.2 show the difference of the 217 
and 143 GHz power spectra for maskl. The scatter at multipoles 
< 100 is almost identical to the scatter of the purple points, 



14 Actually, as demonstrated in Fig. 5, the CIB dominates over 
Galactic dust emission over most of the area of maskO, but this will 
not be a precise template for the CIB emission at cosmological chan- 
nels: (a) because the spectrum of the CIB differs slightly from Galactic 
dust; (b) the CIB emission decorrelates from high to low frequencies 
because lower frequencies probe galaxies at higher redshifts. 
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Figure B.l. The set of unapodised foreground masks, G22, G35, G45, G56, G65, PS96, which, once apodised, are used for the 
likelihood analyses. These Galactic masks are defined using a threshold of the 353 GHz Planck temperature map. 



but increases slightly at higher multipoles. This behaviour is ex- 
pected and is caused by the additional foreground components 
(CIB/point sources/SZ) which become comparable in amplitude 
to Galactic dust at multipoles greater than a few hundred. 

We would also expect a strong dependence of the inter- 
frequency residuals with frequency at low multipoles. Diffuse 
Galactic emission rises steadily in amplitude from 100 GHz to 
217 GHz and hence we would expect the inter- frequency scat- 
ter to rise as we go up in frequency. This is what we see in the 
real data (shown in Fig. C.3). Since diffuse Galactic emission is 
well approximated by the 857 GHz map at all frequencies, we 
can predict the scatter seen in this figure by scaling 857 GHz 
to lower frequencies. For 143-217 GHz we observe a scatter 
of 16 OuK) 2 over the multipole range 50 < i < 500. So, from 
the 857 GHz scalings to lower frequencies we predict the scat- 
ter given in the Table, which is in excellent agreement with the 
scatter seen in Figure C.3. 
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Appendix D: Validity tests 

DA. Detailed Validity checks 

In this appendix we show the full set of figures for the distribu- 
tion of the cosmological and foreround model parameters for the 
suite of tests described in Sect. 7. We also show the correlation 
matrix between these parameters and the calibration coefficents 
of each detector. 

D.2. Cosmological parameters from Planck 70 GHz data 

For this analysis we implement the pseudo-Q method described 
in Hivon et al. (2002) extended to derive both auto- and cross- 
power spectra from the 70 GHz maps (see, e.g. Polenta et al. 
2005, for a comparison between the two estimators). The noise 
power spectrum and the covariance matrix are computed using 
1000 realistic Planck simulations (FFP6, Planck Collaboration I 
2013) of both signal and noise maps. The beam window 
functions are presented in Planck Collaboration IV (2013), and 
mode-coupling kernels to correct for incomplete sky cov- 
erage are computed from formulae analogous to those in 
Appendix. A.l. 

In Fig. D.9 we show the auto- and cross-power spectra com- 
puted from the 70 GHz maps, where cross-spectra are obtained 
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Figure B.2. Apodised Galactic and point source masks which we combine to use in the likelihood analysis. From left to right and 
top to bottom, the panels show the GA34, GA38, GA54 and PSA82 masks. 



by cross-correlating maps from different pairs of horns (there 
are three such pairs in total). We use these to construct a like- 
lihood at £ > 49 by assuming a Gaussian distribution for the 
band-powers, and include the covariance matrix estimated from 
simulations. To estimate cosmological parameters we use this 
likelihood in combination with the Planck low-£ likelihood. We 
marginalize over a single extragalactic foreground parameter, 
which is a Poisson term C^ modeling unresolved residual point 
sources. Fig. D.10 shows the resulting parameters, compared to 
those from CamSpec. Considering the different £ range contrib- 
uting to the two analysis, the parameter distributions are consist- 
ent. In Fig. D.l 1, we show 70 GHz parameters for three choices 
of the maximum £ considered: £ max = 800, 1000, and 1200. The 
latter two are consistent. Minor discrepancies are displayed at 
£max = 800, which can be explained since there is, in this range, 
no detection of the point source component. In the same fig- 
ure, we show results using the Plik likelihood for £ max - 1008, 
which is consistent with Planck 70 GHz over the same £ range. 

D.3. Consistency of the Planck low resolution CMB maps 

Here we extend the discussion presented in Sect. 8.3. In Fig. 
D.12 we show the power spectrum of the residual maps, relative 
to Commander, for NILC, SEVEM, and SMICA. The maximum dis- 
crepancy in the range £ < 40 (the multipole where noise begins 
to become non-negligible) is localized at the quadrupole and is 
less than 20 yuK 2 , whereas for the range 3 < £ < 40 the differ- 
ences are of order « 5/dK 2 . Overall, the Planck maps are more in 
agreement among themselves than with WMAP, except perhaps 
at the quadrupole. The residual map between WMAP and Planck 
shows power spectrum residuals from ^10 jiK 2 up to « 40 jjK 2 
to £ < 40. These figures should be compared to the residual es- 
timated from simulated foreground maps, shown to be < 10 yuK 2 
at £ < 70 in Planck Collaboration XII (2013). 

We complement our results with those obtained from 
the 'FFP6' simulations described in Planck Collaboration XII 



(2013). They consist of 1000 signal plus noise maps processed 
through each of the four component separation pipelines. 

For each Monte Carlo realization, we follow the same pro- 
cedure as in the previous section, i.e. smoothing (FWHM = 
329. 8 Y) and binning the maps to A^de = 32. We apply this pro- 
cedure to both the CMB input maps and the output maps derived 
by the four component separation algorithms. Again, a Gaussian 
white noise with a variance of 4 jiK 2 is added, and the noise cov- 
ariance matrix is corrected accordingly. Note that the additional 
white noise is taken into account not only for numerical regular- 
ization (to this extent its amplitude may well be lower), but prin- 
cipally because the output instrumental noise processed through 
component separation and downgraded to low resolution is far 
from being white. The additional white noise makes the detailed 
knowledge of the full noise covariance matrix unimportant. 

For each realization and for each component separation 
code, we compute the power spectrum of the processed map and 
of the input signal map, using a common mask. When estimat- 
ing the power spectrum, the input CMB maps are regularized by 
adding a negligible Gaussian white noise with 0.1 yuK 2 variance. 
We have checked explicitly that when the white noise is added 
to the component separated maps, we are able to recover the in- 
put power spectrum without bias up to £ ~ 60 for all the four 
component separation methods. 

Moreover, we compute the power spectrum of the difference 
maps (output processed map minus input CMB) for each realiza- 
tion, in order to evaluate the total amount of residual noise. Note 
that this is not only given by the added regularization noise, but 
also from the intrinsic noise, albeit small, that is present in the 
maps. 

Fig. D.13 shows the average and the Icr levels of such noise 
residuals for each of the component separation solutions. We 
thus see, that the level of such total noise residuals is well be- 
low the difference plotted in Fig. D.12. Therefore, we argue that 
the existing differences between the codes are due to genuine 
foreground separation residuals. 
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ILC -1-857 dust template 




317 dx7 map 




Figured. Top: An ILC CMB map constructed from 100- 
353 GHz maps. Middle: The ILC map added to the 857 GHz 
map, scaled to match the diffuse dust emission at 217 GHz. This 
map (and an equivalent at 143 GHz) is used as the CMB+dust 
template to assess CMB/foreground cross correlations. Bottom: 
The real 217 GHz map. 



Appendix E: Dust cleaning using Planck 353 GHz 

The WMAP polarization products are weighted combinations of 
Ka, Q, and V band 9 year maps. The WMAP analysis mitig- 
ates polarized foreground emission using template fitting, with 
synchrotron and dust foregrounds. As a template for synchro- 
tron emission, the WMAP K band channel is used, and for dust a 
polarization model is used to create a template map (Page et al. 
2007). Here we assess the impact on the WMAP polarization sig- 
nal when this dust template is replaced by the Planck 353 GHz 
map, which can be assumed to trace the dust better. This is 
only for comparison; we continue to use the WMAP polariza- 
tion products as released by the WMAP team (except using the 
Planck a^T map as discussed in Sect. 8). 
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Figure C.2. The magenta points show the double difference 
power spectrum (as in Figure 3. The green line shows the same 
double difference spectrum computed from the ILC + 857 dust 
template maps described in the text. The blue points show the 
difference of the 217 and 143 power spectra for mask_l. In all 
cases, the smoothed dust power spectrum model of equation (9) 
has been subtracted. 
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Figure C.3. Power spectrum residuals between the summed 
cross- spectra at three HFI frequencies. The top panel shows 
100 x 100 vs 143 x 143, the middle panel shows 100 x 100 vs 
217x217 and the bottom panels shows 143x143 vs 217x217. A 
'best fit' model for unresolved foregrounds has been subtracted 
from the power spectrum at each frequency (see Section xxx for 
further details). The scatter in the multipole range 50 < £ < 500 
is listed in each panel. 



Foreground cleaned maps can be written as 

niclean = m * ~~ ^/ m synch - A' m dust 

where m = (Q,U) are linear polarization Stokes parameter 
maps, and the index i is for Ka, Q, and V bands. Here m sync h 
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Figure D.l. Impact on cosmological and foreground parameters of fixing the calibration and beam coefficients at their maximum 
posterior value (red), compared to marginalizing over these nuisance parameters (blue). 



is the WMAP 9-year K band map and for m dust we use either the 
WMAP dust template or the Planck 353 GHz maps. For each fre- 
quency band, the scaling coefficients or* and fa are estimated by 
minimizing the^ 2 : 



X 2 (a, P) = m^ lean C Viean 



(E.l) 



where C is the covariance matrix. Following the WMAP analysis, 
we do not include the signal contribution in C and we only use 
the diagonal elements of the noise covariance matrix to estimate 
X 2 - Scaling coefficients are computed using two different WMAP 
masks: the 'processing mask' that masks a narrow region in the 



plane of the Galaxy, and the more conservative 'P06' mask used 
for power spectrum estimation and cosmological analysis. 



The template coefficients at each channel are shown in 
Fig. E, estimated using the two different Galactic masks. At Q 
and V band the estimated coefficients are consistent for the two 
masks; at Ka band the 353 GHz map gives more consistent res- 
ults than the WMAP dust template. The coefficients using the 
P06 mask are more uncertain however, as the residual dust signal 
outside the mask is low, especially for Ka band. We find that the 
preferred synchrotron coefficient, a, is slightly lower using the 
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Figure D.2. Covariance matrix between cosmological parameters and detector cross-calibration coefficients (the calibration of the 
143-5 detector is set to 1 to avoid an overall degeneracy with the total signal amplitude). 



Table E.l. Reduced^ 2 values obtained from Eq. E.l for map 
pixels outside the WMAP 9-year processing mask. The number 
of d.o.f. is 5742. 





Ka 


Q 


V 


Planck 353 GHz 


1.127 


1.132 


0.991 


WMAP dust model 


1.135 


1.149 


1.030 



353 GHz map, and the overall^ 2 , shown in Table E, is slightly 
improved using the Planck dust map. 

We test the effect on cosmological parameters, in particular 
the optical depth to reionization, using the two different tem- 
plates. Using the Planck 353 GHz channel as the dust template, 
with coefficients estimated using the processing mask, lowers 
the best fit value of r by about lcr (see Fig. E.2). we find 
r = 0.075 ± 0.013, compared with r = 0.089 ± 0.013 using 
the WMAP dust model. This also has the effect of lowering A s , 



from 3.088+0.025 to 3.061 ±0.025, but other ACDM parameters 
are not affected. We note though that using template coefficients 
estimated outside the P06 Galactic mask, the optical depth us- 
ing the Planck template is lowered by only 0.5<x compared to 
the WMAP template, indicating some spatial dependence. We 
conclude that the impact on cosmological parameters from the 
choice of dust template is not significant, but anticipate more 
extensive analysis with the full Planck polarization data set. 
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Figure D.3. 



». Covariance matrix between foreground parameters and detector cross-calibration coefficients, as in Fig. D.2. 
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Figure D.5. Impact on the cosmological and foreground parameters of varying the maximum multipole, £ n 
1008 (blue), 1503 (purple), and 2013 (green); compared to the reference l max - 2508 (red). 
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Figure D.7. Impact of changing the Galactic mask, increasing the sky area used from G22 (blue), G35 (purple), G45 (red, reference), 
to the least conservative G56 (green). All results use the 100 < £ < 2508 range. Note that the CamSpec likelihood uses the G35 
mask for the 143 and 217 GHz channels, and G56 at 100 GHz, but with a restricted, composite multipole range. 
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Figure D.8. Impact on parameters of removing one single frequency channel (i.e., all spectra with at least one frequency in the 
removed channel). Results are shown removing the 100 GHz (green), 143 GHz (purple), or 217 GHz (blue) channels, compared to 
the reference case (red). Where the 217 GHz channel is removed, the CIB spectral index is held fixed at y cm = 0.6. 
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i 

Figure D.9. The power spectrum from the Planck 70 GHz chan- 
nel, before removal of unresolved sources. Both the auto- 
spectrum from the 70 GHz maps, and the weighted cross spectra 
from maps of the three 70 GHz horn pairs are shown. The best- 
fitting cosmological model from CamSpec, shown for compar- 
ison with the best-fitting source power added, gives a good fit to 
the data. 
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Figure D.10. Cosmological parameters derived from the 70 GHz maps (solid black) are compared to CamSpec results (red dashed). 
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Figure D.ll. Cosmological parameters derived from the 70 GHz maps for different values of the maximum multipole - £ max = 800 
(green dotted), £ max = 1000 (blue dashed), and £ max = 1200 (solid black) - are compared to Plik at £ max = 1008 (pink dot-dashed). 
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Figure D.12. The power spectrum of the residual CMB maps, re- 
lative to Commander, for the NILC, SEVEM, and SMICA methods. 
The lcr expected noise level is shown in black. 
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Figure D. 13. Estimated total residual noise (intrinsic and regu- 
larizing white noise) levels for each of the four Planck CMB 
maps: Commander, Nile, Severn, and Smica. Solid lines show 
the average and the hatched regions show the 68% CL. 
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Figure E.l. l<x and 2<x contours for the template coefficient scalings estimated using the WMAP dust template (top) and the Planck 
353 GHz map as a dust template (bottom), for the Ka, Q, and V bands. We compare the coefficients estimated using the WMAP 
'P06' mask, to this with the smaller WMAP 'processing mask'. We also indicate the template values quoted in the WMAP paper 
Bennett et al. (2012). 
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Figure E.2. ID (top) and 2D (bottom) posterior probability for 
r and the combination r - A s , for two different Galactic dust 
templates. These are computed using dust template coefficients 
estimated with the WMAP 'processing' mask. The difference in 
r is reduced to ~ 0.5<x if template coefficients are estimated out- 
side the 'P06' Galactic mask. 
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